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Introduction 


Estrogen  Receptor  modulators,  such  as  Tamoxifen,  represent  one  of  the  most  accepted  therapy 
for  the  prevention  and  treatment  of  breast  cancer.  However,  the  resistance  of  estrogen  receptor¬ 
negative  tumor  cells  to  treatment,  and  the  possibility  that  Tamoxifen  may  increase  the  chances  of 
endometrial  carcinoma,  create  the  need  for  improved  therapeutic  strategies.  The  Retinoid 
Receptors,  two  other  members  of  the  Nuclear  Receptors  (NR)  family,  each  composed  of  three 
isoforms  (a,  P  and  y),  have  been  recognized  for  many  years  as  targets  for  breast  cancer  therapy 
(Thacher  et  al  for  review).  Retinoids  exert  their  anticancer  activities  both  in  estrogen  receptor¬ 
positive  and  -negative  cells,  mainly  through  their  abilities  to  modulate  the  growth, 
differentiation,  and  apoptosis  of  tumor  cells.  Although  retinoids  are  thought  to  have  great 
therapeutic  potential,  their  clinical  use  is  so  far  limited,  because  of  toxic  side  effects  (Rizvi  et 
al.).  Consequently,  the  identification  of  novel  structural  motifs  of  small  molecule  modulators  of 
the  retinoid  receptors  is  critical  for  the  development  of  novel  ligands  with  improved  toxicity 
profiles,  which  could  lead  the  way  towards  a  novel  generation  of  drugs  against  breast  cancer. 

Our  goal  is  to  use  state-of-the-art  computer  modeling  technologies  to  develop  such  compounds. 

Body 

Task  1:  Identify  the  structural  determinants  which  confer  a  ligand  retinoid  X  receptor  (RXR)  or 
retinoic  acid  receptor  (RAR)  selectivity 

As  detailed  in  the  previous  annual  report,  we  were  able  to  build  a  model  of  the  RXR  ligand 
binding  domain  (RXR-LBD)  bound  to  9-cis  retinoic  acid  (9-cisRA),  and  docked  RXR  and  RAR 
ligands  to  the  two  receptors.  We  predicted  that  the  ligand  binding  pocket  of  RXR  is  shorter  and 
bulkier  than  that  of  RAR,  which  accounts  for  receptor  selectivity.  Our  prediction  was  recently 
confirmed  by  the  crystal  structure  of  the  RXR/9cis  RA  complex  (Gampe  et  al.). 

As  previously  described,  we  also  docked  RXR-  and  RAR  selective  ligands  into  the  receptors 
pocket  and  showed  that  our  models  correlated  with  experimental  observations:  non-binders  had  a 
positive  calculated  Van  der  Waals  energy  of  interaction,  whereas  binders  had  a  negative  Van  der 
Waals  energy  of  interaction. 

An  increasing  amount  of  data  suggest  that  the  RARa  isoform,  which  controls  the  expression  of 
RARP,  is  the  most  relevant  target  for  breast  cancer  therapy  (Liu  et  al.,  Widschwendter  et  al.).  A 
detailed  understanding,  at  the  atomic  level,  of  the  determinants  for  isoform  specificity  is 
therefore  important  to  design  ligands  with  appropriate  receptor  selectivity.  We  docked  a  RARa 
specific  ligand,  Am580,  in  the  crystal  structure  of  RAR  y,  and  in  a  model  that  we  built  of  RARa 
and  RARP,  and  showed  that  residue  234,  one  of  the  three  residues  of  the  ligand  binding  pocket 
which  are  not  conserved  among  the  three  RAR  isoforms,  was  responsible  for  receptor  isoform 
specificity  (Figure  1). 


Task  2:  Identify  the  structural  properties  a  ligand  must  satisfy  to  be  an  agonist 

The  structural  mechanism  of  agonist  and  antagonist  activity  was  recently  revealed  by  a  series  of 
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crystal  structures  (Moras  and  Gronemeyer  for  review).  Upon  binding  of  an  agonist,  the  C- 
terminal  helix  12  of  the  LBD  undergoes  a  conformational  change,  closes  like  a  lid  on  the  ligand, 
and  forms  a  hydrophobic  pocket  on  the  surface  of  the  receptor.  This  pocket  is  the  target  for 
several  known  and  unknown  coactivator  proteins,  which  bind  the  LBD  through  a  conserved 
LxxLL  motif.  In  the  antagonist-bound  conformation,  the  helix  HI 2  is  pushed  away  from  the 
ligand  binding  pocket,  and  occupies  the  coactivator  binding  site,  preventing  coactivator 
recruitment. 

As  described  in  our  previous  report,  we  collaborated  with  Dr  Herbert  Samuel’s  lab,  at  NYU 
Medical  Center,  to  identify  a  novel  coactivator,  NRIF3,  mediating  functional  specificity  of 
nuclear  hormone  receptors.  This  novel  protein  coactivates  RXR  but  not  RAR.  Our  modeling 
suggests  that  NRIF3  binds  to  liganded  receptors  through  an  LxxIL  module  contained  within  the 
C-terminal  domain  of  the  coactivator.  This  data  was  the  object  of  an  article  published  in 
Molecular  and  Cellular  Biology  and  is  appended  to  this  report  (Article  1). 

Very  interesting  data  recently  published  on  the  structure-activity  relationship  of  another  nuclear 
receptor,  the  estrogen  receptor,  encouraged  us  to  initiate  an  additional  avenue  of  investigation  on 
RAR,  with  possible  important  applications  in  breast  cancer  therapy.  Paige  et  al.  showed  that 
different  agonists  of  the  estrogen  receptor  (estradiol,  estriol,  and  diethylstilbestrol)  could  induce 
different  conformational  changes  of  the  receptor  LBD,  resulting  in  recruitment  of  different 
coactivators  (Paige  et  al.).  Structural  analysis  of  many  nuclear  receptors  suggests  that,  upon 
ligand  binding,  the  core  of  the  receptor  LBD  remains  fixed,  while  the  C-terminal  H12  helix  is 
relocated  and  interacts  with  the  ligand,  thus  generating  a  binding  site  for  coactivator  proteins 
(Moras  and  Gronemeyer,  L.  Freeman).  It  is  therefore  reasonable  to  predict  that  the  different 
estrogen  receptor  agonists  induce  a  different  rearrangement  of  the  H12  helix. 

We  built  low  energy  models  of  the  RAR  LBD  where  the  H12  helix  is  relocated  onto  different 
docking  sites  at  the  surface  of  the  receptor.  Only  a  limited  number  of  rearrangements  of  H12  are 
possible  (Figure  2).  We  plan  to  apply  our  virtual  screening  technology  to  these  structures  (see 
Task  3),  in  order  to  discover  novel  RAR  agonists  that  could  induce  the  recruitment  of  different 
coactivators.  Such  molecules  could  have  a  different  pharmacological  profile,  compared  with 
existing  RAR  ligands. 


Task  3:  Design  rationally  and  test  novel  drugs  against  breast  cancer 

1-Use  our  model  to  screen  libraries  of  compounds  and  select  ligands  with  desired  structural 
characteristics 

As  described  in  our  previous  report,  we  have  built  a  model  of  antagonist-bound  RAR  and  carried 
out  a  high-throughput  computer  screening  where  over  150,000  ligands  were  automatically 
docked  into  the  receptor’s  binding  pocket  and  assigned  a  score  according  to  the  quality  of  the  fit. 
About  thirty  RAR  antagonist  candidates  were  selected  for  in  vitro  assay.  This  work  was 
published  in  February  in  the  Proceedings  of  the  National  Academy  of  Sciences  and  is  appended 
to  this  report  (Article  2). 

We  have  also  used  a  benchmark  virtual  library  made  of  the  structure  of  known  ligands  for 
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different  members  of  the  nuclear  receptor  family  in  order  to  optimize  our  virtual  library 
screening  technology.  We  could  show  that  after  four  independent  screenings,  all  the  RAR 
ligands,  and  only  the  RAR  ligands  were  selected  (Table  1).  This  work  was  also  published 
(Article  2). 

Additionally,  we  have  used  the  crystal  structure  of  agonist-bound  RAR  to  screen  our  virtual 
library  of  over  150,000  ligands  and  selected  about  30  agonist  candidates  to  be  tested  in  vitro. 

This  work  was  recently  accepted  for  publication  with  revisions  at  the  Journal  of  Medicinal 
Chemistry,  and  is  appended  to  this  report  (Article  3). 

Finally,  we  have  used  our  model  of  agonist-bound  RXR  to  screen  our  virtual  compound  library. 
However,  we  became  aware  during  the  screening  that  the  crystal  structure  of  the  RXR-LBD/9cis 
retinoic  acid  complex  was  about  to  be  published  (R  Gampe  et  al.).  As  a  result,  we  postponed  the 
in  vitro  testing  of  the  RXR  agonist  candidates,  and  waited  for  the  coordinates  of  the  crystal 
structure  of  the  receptor  to  carry  out  a  novel  screening  which  should  result  in  an  improved 
selection  of  ligands. 

2-  Test  these  ligands  in  vitro  for  their  receptor  selectivity,  and  their  ability  to  trigger 
transactivation 

As  described  in  our  previous  report,  two  novel  RAR  antagonists  were  discovered,  thus  validating 
our  modeling  strategy  (Article  2). 

We  also  discovered  novel  agonists  of  RAR  (Article  3).  One  of  them,  agonist  3,  presents  very 
original  structural  features:(i)  a  penta-methylated  benzene  linked  by  a  ketone  to  a  second 
aromatic  ring,  forms  a  very  large  hydrophobic  head,  (ii)  A  pyridine  probably  makes  a  strong 
interaction  with  Cys  237  and  replaces  a  carboxylate  or  ester  present  in  all  RAR  ligands  described 
so  far  (Figure  3).  These  novel  structural  features  may  translate  into  improved  toxicity  profiles 
and  result  in  the  development  of  new  ligands  for  breast  cancer  therapy. 

This  last  finding  is  extremely  encouraging,  and  we  initiated  a  collaboration  with  Pr.  Stephen 
Wilson  and  Pr.  Herbert  Samuels  to  discover  analogs  of  agonist  3  with  improved  activity.  Letters 
of  intent  from  Drs.  Wilson  and  Samuels  are  appended.  Dr  Wilson  is  Professor  of  chemistry  at 
New  York  University  and  the  director  of  the  New  York  University  Laboratory  for  Combinatorial 
Chemistry.  He  is  a  renown  expert  in  solid  phase  chemistry,  and  the  author  of  Combinatorial 
Chemistry:  Synthesis  and  Application,  John  Wiley  &  Sons,  March  1997;  edited  by  Wilson  & 
Czamik,  one  of  the  first  books  on  combinatorial  chemistry  and  its  organic  chemistry 
applications.  Pr.  Samuels  is  an  internationally  recognized  expert  in  endocrinology,  and  has  been 
collaborating  with  us  since  the  beginning  of  our  research  effort.  Dr  Matthieu  Schapira,  under  Pr. 
Ruben  Agabyan’s  supervision,  coordinates  this  collaboration. 

RAR  ligands  described  in  the  literature  belong  to  limited  series  of  related  structures,  and  display 
similar  toxic  side  effects.  The  discovery  of  agonist  3  and  the  model  of  its  interaction  with  RAR 
open  an  avenue  towards  new  RAR  agonist  ligands  with  possibly  more  favorable  specificity  and 
toxicity  profiles.  We  are  anxious  to  build  on  our  findings  and,  with  the  help  of  chemists  and 
biologists,  design,  synthesize  and  test  analogs  of  this  novel  structure  with  improved  activity  and 
specificity.  This  work  could  have  direct  applications  in  breast  cancer  therapy. 
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3-  Test  these  ligands  in  vivo  for  their  efficacy  against  rat  mammary  tumor 

We  plan  to  test  in  vivo  the  ligands  discovered  in  collaboration  with  Dr.  Herbert  Samuels. 


Key  Research  Accomplishments 

-We  built  a  model  of  the  active  conformation  of  RXR,  which  was  recently  confirmed  by  a  crystal 
structure,  and  could  derive  the  structural  differences  between  the  RAR  and  RXR  binding  pockets 
responsible  for  ligand  selectivity  (as  illustrated  in  the  previous  annual  report). 

-We  provided  structural  determinants  for  RAR  isoforms  selectivity  (Figure  1). 

-We  modeled  the  interaction  between  RAR  or  RXR  and  NRIF3,  a  novel  nuclear  receptor 
coactivator  (Article  1). 

-We  built  predictive  models  of  novel  conformations  of  active  RAR  and  plan  to  use  these  models 
to  derive  novel  types  of  agonists  (Figure  2). 

-We  discovered  novel  RAR  antagonists  (Article  2). 

-We  optimized  our  virtual  screening  technology  and  demonstrated  its  efficacy  at  discriminating 
between  binders  and  non-binders  (Table  1  and  Article  2). 

-We  discovered  an  RAR  agonist  with  very  original  structural  features,  and  initiated  a 
collaboration  with  chemists  (Pr.  S.  Wilson’s  laboratory)  and  biologists  (Pr.  H.  Samuels’ 
laboratory)  to  design,  synthesize  and  test  structural  analogs  of  this  very  promising  molecule 
(Figure  3  and  Article  3). 


Reportable  Outcomes 

Article  1: 

NRIF3  is  a  novel  co-activator  mediating  functional  specificity  of  nuclear  hormone  receptors 
D.  Li,  V.  Desai-Yajnik,  E.  Lo,  M.  Schapira,  R.A.  Abagyan,  and  H.H.  Samuels 
Molecular  and  Cellular  Biology  1999  Oct;19(10):7191-7202 

Article  2: 

Rational  discovery  of  novel  nuclear  receptor  antagonists 
M.  Schapira,  B.  Raaka,  H.H.  Samuels,  and  R.A.  Abagyan 

Proceedings  of  the  National  Academy  of  Sciences  USA.  2000  Feb  1  ;97(3):  1008-13 
Article  3: 

In  Silico  discovery  of  novel  retinoic  acid  receptor  agonist  structures 
M.  Schapira,  B.  Raaka,  H.H.  Samuels,  and  R.A.  Abagyan 

Accepted  for  publication  with  revisions  to  Journal  of  Medicinal  Chemistry  April  2000 

A  patent  filing  application  was  sent  to  the  Office  of  Industrial  Liaison  of  New  York  University 
School  of  Medicine,  to  protect  our  discovery  of  Agonist  3. 
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Conclusions 

During  the  first  two  years  of  our  effort,  we  have  accomplished  most  of  the  tasks  outlined  in  the 
approved  Statement  of  Work.  We  have  identified  the  structural  determinants  which  confer  a 
ligand  RXR  or  RAR  selectivity,  and  successfully  challenged  our  models  with  known  ligands  for 
either  receptor.  We  have  also  put  a  rational  on  RAR  isoforms  specificity  which  will  be  useful  to 
develop  RARa  specific  agonists  -the  biological  activity  which  seems  the  most  promising  for 
breast  cancer  therapy. 

Based  on  recent  structural  data  on  the  estrogen  receptor,  we  also  produced  models  of  distinct 
RAR  conformations,  which  we  will  use  to  derive  novel  types  of  ligands  for  this  receptor.  Such 
ligands  could  have  interesting  activities,  mirroring  the  complexity  of  small  molecule  estrogen 
receptor  modulators. 

Finally,  we  have  developed  a  strategy  to  rationally  design  RAR  antagonists,  and  were  also  able 
to  discover  novel  RAR  agonists  with  very  original  structural  features.  Our  current  effort  to 
develop  analogs  of  these  novel  molecules  could  result  in  the  identification  of  leads  against  breast 
cancer. 

"So  what": 

We  have  developed  a  series  of  approaches  to  identify  new  antagonists  and  new  agonists  for 
the  retinoid  receptors,  based  on  the  structure  of  the  receptor,  and  regardless  of  existing 
compounds.  This  original  strategy  already  resulted  in  the  discovery  of  active  ligands  presenting 
very  original  features.  We  are  currently  working  in  collaboration  with  chemists  and  biologists  to 
develop  analogs  of  these  molecules  with  improved  activity  and  specificity,  which  would 
constitute  valuable  leads  for  breast  cancer  therapy. 
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Figure  1:  Docking  of  a  known  RARa  specific  agonist.  The 
RARa  selective  agonist  Am580  was  docked  into  the  modeled 
ligand  binding  pocket  of  RARa.  A:  The  ligand  (white 
sticks)  superimposes  with  the  crystal  structure  of  bound  all- 
trans  RA  (green).  Hydrogens  are  not  shown  for  clarity.  B: 
Am580  (CPK  display)  fits  tightly  into  the  receptor’s  pocket 
(yellow  wire),  except  for  a  ketone  oxygen,  which  does  not 
clash  with  the  receptor  only  because  it  can  share  an  hydrogen 
with  Ser234  (displayed  as  stick).  In  RAR(3  and  y,  Ser234  is 
replaced  by  an  alanine  which  clashes  with  the  ketone  oxygen. 
The  receptor  in  the  vicinity  of  the  ligand  is  shown  as  a  white 
ribbon.  Carbons,  hydrogens,  oxygens  and  nitrogen  are  colored 
white,  grey,  red  and  blue  respectively. 
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Figure  2:  Models  of  RAR  LBD  where  the  C-terminal  helix  HI 2  (green,  magenta)  is 
relocated  onto  different  docking  sites  of  the  receptor  (surface  representation)  could 
provide  novel  binding  pocket  conformations  for  the  structure  based  discovery  of 
agonists  (cyan)  with  improved  pharmacological  profile. 
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Ligand 

Activity  Scorel 

Score2 

Score3 

Score4 

Selected 

Binding 

AGN  193836RARa  agonist  -37.2 

-36.5 

-36.7 

-35.3 

+ 

+ 

ATRA 

RAR  pan-agonist  -5 1 .7 

-52.6 

-51.8 

-52.0 

+ 

+ 

Ro4 15253 

RARa  antagonist  -28.9 

-24.4 

-39.0 

-46.6 

+ 

+ 

MX781 

RAR  antagonist  -45.3 

-48.0 

-40.2 

-45.6 

+ 

+ 

CD2366 

RAR  pan-  7 

antagonist. 

-50.8 

-29.3 

-29.3 

+ 

+ 

Targretin 

RXR  pan-agonist  -25.4 

-23.0 

-22.2 

-31.0 

- 

- 

SRI  1203 

RXR  pan-agonist  -28.2 

-22.7 

-22.1 

-27.5 

- 

- 

Tamoxifen 

ER  modulator  -26.4 

-24.6 

-30.3 

-23.4 

- 

- 

Raloxifene 

ER  modulator  -15.6 

-23.7 

-18.4 

-17.4 

- 

- 

RU486 

Progest  Rec  antag.  -21.4 

-20.6 

-20.3 

-20.1 

- 

- 

9cisRA 

RAR/RXR  3g  g 

agonist. 

-39.5 

-33.5 

-38.7 

+ 

+ 

AGN  1 931 09 pan'  -55.1 

antagonist. 

-55.5 

-41.2 

-54.8 

+ 

+ 

AGNpartia 

RAR  partial  ^  ^ 

agonist. 

-61.3 

-61.4 

-61.0 

+ 

+ 

Am580 

RARa  agonist  -46.6 

-47.2 

-46.6 

-46.5 

+ 

+ 

EM652 

ER  antagonist  -26.3 

-23.1 

-23.7 

-27.3 

- 

- 

Antagonist! 

Novel  RAR  antag  -32. 1 

-32.1 

-31.7 

-31.6 

+ 

+ 

Antagonist2  Novel  RAR  antag  -33.3 

-29.7 

-33.8 

-33.8 

+ 

+ 

Table  1:  Control  screening  of  known  ligands.  A  similar  screening  as  the  one  performed  on  the 
large  library  was  carried  out  four  times  on  a  small  database  made  of  known  nuclear  receptor 
ligands:  the  compounds  were  automatically  docked  in  the  structure  of  RAR.  The  molecules 
which  met  at  least  once  with  the  criteria  for  selection  are  listed  as  "Selected".  The  ligands  which 
are  experimentally  binding  to  RAR  are  listed  as  "Binding". 
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Figure  3:  Unlike  all  retinoid  receptor  ligands  described  in  the  literature.  Agonist  3  (green) 
does  not  have  a  carboxylate,  but  a  pyridine  which  makes  a  thiolate  salt  with  a  cystein  (white) 
of  the  receptor,  and  could  also  interact  with  a  neighboring  arginine  (white).  This  compound  is 
active  at  200  nM,  and  the  rational  design  of  analogs,  which  conserve  this  original  feature  could 
result  in  leads  against  breast  cancer  with  improved  specificity  and  toxicity  profiles.  We 
embarked  on  the  development  of  such  molecules  in  collaboration  with  renowned  chemists  and 
biologists. 
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tor  proteins  primarily  belong  to  two  groups:  the  SRC-1  family 
and  the  CREB-binding  protein  (CBP)/p300  family.  The  SRC-1 
family  includes  SRC-l/NCoA-1  (37,  58,  74)  and  the  related 
proteins  GRIP17TIF2/NCoA-2  (34,  35,  74,  79)  and  AIBl/p/ 
CIP/ACTR/RAC3/TRAM-1  (2,  14,  44,  73,  74).  The  second 
group  of  coactivators  includes  CBP  and  its  homolog  p300, 
which  not  only  influence  the  activities  of  nuclear  receptors  (13, 

31  37)  but  also  functionally  interact  with  many  transcription 
factors  such  as  CREB  (3, 16, 40, 46),  the  Stats  (10,  87),  API  (4, 

7),  and  p53  (28, 45).  There  are  also  coactivator  proteins  that  do 
not  belong  to  these  two  groups,  such  as  ARA70  (85),  PGC4 
(60),  and  the  recently  reported  RNA  coactivator  SRA  (41). 
Members  of  both  the  SRC-1  family  and  CBP/p300  family  have 
been  shown  to  possess  histone  acetyltransferase  activities  (8, 
14,  57,  69),  suggesting  that  chromatin  remodeling  by  histone 
acetylation  is  an  important  mechanism  involved  in  transcrip¬ 
tional  activation  by  ligand-bound  nuclear  receptors. 

Interaction  of  members  of  the  SRC-1  and  CBP/p300  families 
with  nuclear  receptors  occurs  through  conserved  LXXLL  mo¬ 
tifs  (32),  which  interact  with  a  hydrophobic  cleft  in  the  receptor 
LBD  formed  as  a  result  of  conformational  changes  mediated 
by  ligand  binding  (19,  23,  56).  SRC-l/NCoA-1  and  GRIP1/ 
TIF2  contain  three  LXXLL  regions  or  boxes  (referred  to  as 
LXDs  or  nuclear  receptor  boxes)  that  differentially  interact 
with  nuclear  receptors  so  that  different  nuclear  receptors  func¬ 
tionally  utilize  different  LXXLL  boxes  (19,  52).  Thus,  ER  uses 
the  second  LXXLL  box  of  SRC-l/NCoA-1  while  PR  uses  both 
the  first  and  second  LXXLL  boxes  for  optimal  interaction.  In 
contrast,  TR  and  RAR  require  both  the  second  and  third 
LXXLL  boxes  for  optimal  interaction  (52).  The  specificities  of 
receptor  recognition  by  the  different  LXXLL  boxes  of  SRC-1/ 
NCoA-1  are  primarily  mediated  by  8  amino  acid  residues  C 
terminal  to  the  LXXLL  motif  rather  than  by  the  2  amino  acids 
(XX)  within  the  motif  itself.  Thus,  while  members  of  the 
SRC-1  family  are  capable  of  interacting  with  many  nuclear 
receptors,  the  molecular  details  of  such  interactions  differ  for 
each  receptor  in  the  number  or  combination  of  LXXLL  boxes 
used  as  well  as  in  the  critical  amino  acid  residues  surrounding 
the  LXXLL  motifs. 

While  much  has  been  learned  from  the  study  of  known 
coactivators,  a  number  of  key  mechanistic  questions  remain  to 
be  answered.  For  example,  many  nuclear  receptors  can  recog¬ 
nize  common  DNA  elements,  (25,  49,  51),  while  not  all  are 
capable  of  regulating  genes  containing  those  elements  (20, 47, 
65).  Thus,  how  native  target  genes  containing  such  elements 
are  selectively  regulated  by  specific  receptors  is  a  very  impor¬ 
tant  but  poorly  understood  problem.  Although  the  various 
LXXLL  boxes  of  SRC-1  and  GRIP1  show  differential  receptor 
preference  (19,  52),  these  coactivators  are  unlikely  to  play  a 
primary  role  in  mediating  effects  that  are  receptor  specific 
since  they  appear  to  interact  with  all  ligand-bound  nuclear 
hormone  receptors.  Thus,  the  detailed  molecular  mecha¬ 
nism^)  underlying  receptor-specific  regulation  of  gene  expres¬ 
sion  remains  to  be  elucidated.  Whether  a  coactivator(s)  con¬ 
tributes  to  this  specificity  is  currently  unknown. 

To  further  our  understanding  of  the  molecular  events  un¬ 
derlying  receptor-activated  transcription,  we  sought  to  identify 
additional  coactivators  using  a  yeast  two-hybrid  screening 
strategy  (29).  In  this  paper,  we  report  the  isolation  of  a  novel 
coactivator  for  nuclear  receptors,  designated  NRIF3  (for  nu¬ 
clear  receptor-interacting  factor  3).  Fluorescence  microscopy 
indicates  that  NRIF3  is  a  nuclear  protein.  The  yeast  two-hybrid 
and  in  vitro  binding  assays  revealed  that  NRIF3  interacts  spe¬ 
cifically  with  TR  and  RXR  in  a  ligand-dependent  fashion  but 
does  not  interact  with  other  examined  nuclear  receptors. 
Transfection  experiments  indicated  that  NRIF3  selectively 


potentiates  TR-  and  RXR-mediated  transactivation  in  vivo. 
The  NRIF3  gene  encodes  a  small  protein  of  177  amino  acids 
and,  other  than  having  an  N-terminal  LXXLL  motif,  has  no 
homology  with  known  coactivator  genes.  The  results  of  a  com¬ 
bination  of  computer  modeling  and  domain  and  mutagenesis 
analyses  suggest  that  NRIF3  interacts  with  nuclear  receptors 
through  its  C-terminal  domain  that  contains  a  novel  LXXIL 
module  while  another  part  of  NRIF3  may  contribute  to  its 
observed  receptor  specificity.  These  findings  may  provide  novel 
insight  into  the  molecular  mechanism(s)  of  receptor-mediated 
transcriptional  activation  as  well  as  the  functional  specificities 
of  nuclear  receptors. 


MATERIALS  AND  METHODS 

Isolation  of  NRIFs  and  the  yeast  two-hybrid  assay.  The  Brent  two-hybrid 
system  (29)  was  employed  to  isolate  candidate  cDNA  clones  interacting  with 
LexA-TRa  in  a  ligand-dependent  fashion.  Full-length  chicken  TRa  (cTRa)  was 
fused  in  frame  to  the  C  terminus  of  the  LexA  DNA  binding  domain  (DBD)  in 
pEG202  (29).  The  LexA-TRa  bait,  the  LacZ  reporter  (pSH18-34),  and  a  pJG4- 
5-based  HeLa  cell  cDNA  library  were  transformed  into  the  yeast  strain  EGY48 
(29).  The  transformants  were  selected  on  Gal-Raf-X-Gal  (5-bromo-4-chloro-3- 
indolyl-p-D-galactopyranoside)  medium  in  the  absence  of  leucine  and  were  fur¬ 
ther  screened  for  the  expression  of  LacZ  in  the  presence  of  1  pM  T3.  Blue 
colonies  were  picked  and  reexamined  for  T3-dependent  expression  of  LacZ. 
Positive  yeast  clones  were  then  selected,  and  plasmids  harboring  candidate  prey 
cDNAs  were  isolated.  An  individual  candidate  prey  plasmid  was  then  amplified 
in  Escherichia  coli  and  retransformed  into  the  original  yeast  strain  to  confirm  the 
interaction  phenotype.  The  cDNA  inserts  were  then  sequenced  with  an  auto¬ 
matic  sequencer.  Four  novel  clones  (NRIF1,  -2,  -3,  and  -4)  were  obtained. 
Among  them,  NRIF3  was  a  full-length  clone. 

Wild-type  NRIF3,  the  03-endonexin  long  form  (EnL)  and  short  torm  (fcnb), 
and  the  L9A  NRIF3  mutant  protein  were  examined  for  their  interaction  with 
various  nuclear  receptors  in  a  yeast  two-hybrid  assay.  The  following  receptor 
baits  were  used:  the  LexA-cTRa  LBD,  LexA-human  TR(3  (hTRp)  LBD,  LexA- 
hRARa  LBD,  LexA-hRXRa  LBD,  and  LexA-hGR  LBD.  The  NRIF3  C-termi- 
nal  domain  (NCD)  was  fused  in  frame  with  the  LexA  DBD  and  examined  for 
interaction  with  receptor  LBDs  with  the  following  preys:  the  B42-cTRa  LBD, 
B42-hRARa  LBD,  and  B42-hRXRa  LBD  expressed  from  pJG4-5.  Yeast  cells 
harboring  appropriate  plasmids  were  grown  in  selective  media  with  Gal-Raf  in 
the  presence  or  absence  of  cognate  ligand  (1  pM  T3  for  TR,  all  trans  or  9 -as  RA 
for  RAR,  9-cis  RA  for  RXR,  and  10  pM  deoxycorticosterone  for  GR)  overnight 
before  (3-galactosidase  activity  was  assayed  with  o-nitrophenyl  (S-D-galactopyr- 
anoside  as  the  substrate.  (3-Galactosidase  units  were  calculated  with  the  formula 
(OD420  x  l,000)/(minutes  of  incubation  x  OD^  of  yeast  suspension),  where 
OD420  and  OD600  are  the  optical  densities  at  420  and  600  nm,  respectively. 

Fluorescence  microscopy.  Full-length  NRIF3  was  cloned  into  the  green  fluo¬ 
rescent  protein  (GFP)  fusion  protein  expression  vector  pEGFP  (Clontech).  The 
resulting  GFP-NRIF3  vector  and  the  control  plasmid  pEGFP  were  transfected 
into  HeLa  cells  by  calcium  phosphate  coprecipitation.  Cells  were  incubated  at 
37°C  for  24  h  before  the  examination  with  a  fluorescence  microscope  to  deter¬ 
mine  the  subcellular  location  of  GFP-NRIF3  or  the  GFP  control. 

In  vitro  binding  assay.  Full-length  NRIF3  was  cloned  into  pGEX2T,  a  bacte¬ 
rial  glutathione  S- transferase  (GST)  fusion  protein  expression  vector  (Pharma¬ 
cia).  The  GST-NRIF3  fusion  protein  was  expressed  in  E.  coli  and  affinity  purified 
with  glutathione-agarose  beads  (30).  35S-labeled  full-length  cTRa,  hRARa, 
hRXRa  hVDR,  hGR,  hPR,  and  hER  were  generated  by  in  vitro  transcription 
and  translation  with  a  reticulocyte  lysate  system  (Promega).  Binding  was  per¬ 
formed  as  previously  described  (30)  with  the  following  buffer:  20  mM  HEPhb 
(pH  7.9)-l  mM  MgCl2-l  mM  dithiothreitol-10%  glycerol-0.05%  Triton 
X-100-1  pM  ZnCl2-150  mM  KC1.  Appropriate  ligands  were  added  into  the 
binding  reaction  mixture  where  indicated  in  the  figures  in  the  following  concen¬ 
trations:  1  pM  T3  for  TR,  1  pM  all -trans  RA  or  9-cis  RA  for  RAR,  1  pM  9 <ts 
RA  for  RXR,  and  150  nM  l,25-(OH)rvitamin  D3,  dexamethasone,  progester¬ 
one,  or  estradiol  for  VDR,  GR,  PR,  or  ER,  respectively.  After  the  binding 
reaction,  the  beads  were  washed  three  times  and  the  labeled  receptors  bound  to 
the  beads  were  examined  by  sodium  dodecyl  sulfate-10%  polyacrylamide  ge 
electrophoresis  followed  by  autoradiography.  Five  percent  of  the  S-labeled 
receptor  input  was  also  electrophoresed  in  the  same  gel. 

Transfection  studies.  Most  reporters  used  in  this  study,  including  IR-AMTV- 
CAT  DR4-AMTV-CAT,  GH-TRE-tk-CAT,  and  IR+3  (ERE)-AMTV-CAT, 
have  been  described  previously  (5,  25,  78).  A  DR1-AMTV-CAT  reporter  re¬ 
sponsive  to  RXR  was  obtained  from  Ron  Evans.  A  GRE/PRE-tk-CAT  reporter 
was  obtained  from  Gunther  Schutz.  (!R)2-TATA-CAT  was  constructed  in  our 
laboratory  by  cloning  two  copies  of  the  inverted-repeat  (IR)  sequence  (AGG 
TCA  TGACCT)  upstream  of  a  TATA  element  derived  from  the  thymidine 
kinase  (tk)  promoter.  An  hVDR  expression  vector  and  VDRE-AMTV-CAT 
containing  the  VDRE  from  the  osteocalcin  promoter  were  obtained  from  J. 
Wesley  Pike.  Vectors  expressing  cTRa,  hRARa,  hRXRa,  rat  GR  (rGR),  hPR, 
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and  UER  have  been  described  previously 

expression  vector  was  constructed  by  8  HeLa  cells  by  calcium 

phosphate  coprecipitation  with  25  to  100  ng ot  m  ng  of  the  NRIF3  0r 

chloramphenicol  acetytransferase  (CAT)  p  »  jncubated  at  37°C  (with  or 
control  pExpress  vector.  After  trans  ec  1  ,  harvested  CAT  assays  were 

i&xzxs- 

or  quadrup,icate 

ligated  to  an  EnL  or  EnS  insertgenera  ed  fr0™  SjG4-5^r,S  plasmid 

b yNM  double  digestion.  NRTO  was 

was  confirmed  by  sequence  analy  .  based  method,  and  the  muta- 

generated  by  site-directed  mutageneses  by  a  PCR-ba  ed  memo  ^ 

tion  was  confirmed  by  sequence  analysis.  pJG4-5  ‘  strains  har. 

EnS,  or  the  L9A  NRIF3  ^S^SStaKLds  (LexA-TR, 

Sr  SS  and  UxA-GR)  Transformants  were  subjected  to 

quantitative  assays  of  p-galactosidase  activity ^d®s'pbe  del^ative  of  pEG202 
To  construct  the  bait  plasmid  expressing  UxA-NCD h  a  w  andFligated 

(which  contains  a  new  polylinker)  was  digested  with  NgRIF3 

to  synthetic  oligonucleotides  that  encc i  e  generated  by  using  oligo- 

(residues  162  to  177).  Similarly,  the  mutantNCD  was  generatea  ^ 

nucleotides  that  contain  the  designed  mutations  in  tn  ^  expressing  LexA- 

constructs  were  confirmed  by  sequence  analys^Ba  t  plasm,  P 

NCD  “  LED,  into  the  yeast 

(pSH18-34)- Subsequent  tw°-hy  assays 

were  carried  out  as  described  above^  built  a  model  0f  the  inter- 

Docking  of  coactivator  peptides  to  ° NRIF3  (KASRHLDSYE 

action  between  the  17-residue  C-  erm  P  P  d  as  an  example  in 

s^^w^Ssrrjsssas 

docking  procedure  was  carried  ou  o  nuclear  hormone  receptors  and 

features  of  the  interaction  between  the  LBDs  of  nuclear ^nor m  ^  „ 

their  coactivators  were  used  to  build  em  •(  PPAR7  (56),  where  a 

initially  observed  in  the  complex 

conserved  glutamate  and  lysine  at  opposi 1  ivator.s  lxXLL  box.  This  feature 
receptors  contact  the  backbone  of  th  feature  was  that 

enabled  us  to  orient  the  NRIF3  helical  peptide^(n)  The  ^ 

~ °f  the  NRIF3  peptide 

Wh-^^coactlvtto7pe,ptrides  were  aKi^>e<^3^helical  secondary^  structure^the 

backbone  4>  and  angles  being  restraint^  were  set  between  the  charge 

was  set  to  180  degrees.  The  energy  of  the 

-  SS  -  T  ICNlmethod 

helix, ’and  the  side  chain  torsioni Ca,culated  by  the  parti- 
Binding  energy  calculation.  The  binding  S7  binding  energy  function 
tioning  method  as  described  elsewhere  (  )•  36  determined  as 

is  partitioned  into  three  terms:  gf  ^  cal/mo(/ 

chains  partially  or  completely  buried 

upon  complexation. 

results 

Cloning  of  NRIF3  cDNA.  To  isolate  potential  coactivators 
mediating  the  transcriptional  activation  functions  of  nuclear 
receptors!  we  employed  a  yeast  two-hybrid  screening  strategy 
(29).  A  bait  expressing  a  full-length  TRa  K  » 

UccTathyroid  hormone  (T3l-Jcpendeni  inieract'onwiihl.erA- 
TRa  were  selected  and  further  examined  and  sequenced.  Four 
™e.  d«S“  re  identified.  and  all  were  found  .o  eah.b.t 


150 

1  I 

1  II 

i  1  J-l 


Baits  Lex  A  LexA-TR  LexA-TR 

Preys  NR1F3  NRIF3  TR1P1 

r  1  Hormone-dependent  interaction  of  NRIF3  with  the  LBD  of  TR. 
Induction  of”to^dL  activity  by  thyroid  hormone  (T3)  was ^asuredm 
ItS"J„gEGY48  transformed  with 

B42-Tripl  was  used  as  the  positive  control.  Hatcn  , 
with  1  p-M  T3. 

levels  of  interaction  with  the  LBD  of  TRa  similar  to  the  levels 
they  exhibited  with  the  full-length  receptor  (data  not  sho  )• 
These  clones  were  designated  NRIF1,  -2,  -3,  an  -  . 
prisingly,  the  LBD  of  TRj3  was  also  found  to  interact  with  these 
NRIFs  to  a  T3-dependent  manner  (data  not  shown).  Amo  g 
ThSe  four  isolated" NRIFs,  NRIF3  was  a 
shown  in  Fig.  1,  LexA  alone  (negative  control)  did  not  intera 
with  NRIF3  (as  indicated  by  the  low  p-galactosidase  ac  lvity) 
and  incubation  with  T3  had  no  effect.  Similarly,  no  interaction 
™  detected  between  the  LexA-TR LBD  and  B«  alone  »,  h 
or  without  T3  (data  not  shown).  The  LexA-TR  LBD 
showed  little  interaction  with  NRIF3  in  the  absence  of  T3. 
However,  incubation  with  T3  resulted  ,n  strong 
the  NRIF3-TR  LBD  interaction  (Fig.  1).  The  extf 1 a±,° I  Li 
dependent  interaction  between  NRIF3 land  the  LexA-TR 
was  similar  to  that  of  Tripl  (Fig.  1),  one  of  the  first  TR 
interacting  factors  cloned  in  a  two-hybrid  screen  (42F 

Sequence  analysis  of  NRIF3.  Sequence  analysis  of  the  NR1F3 
cDNA  revealed  a  single  open  reading  frame  encoding  a  po  y- 
peptide  of  177  amJacids  (Fig.  2).  NRIF3  has  no  homology 
with  members  of  the  SRC-1  andCBp/p300  f* 
of  NRIF3  is  in  sharp  contrast  to  the  size  ofCBp/p300(aroun 
300  kDa)  or  of  SRC-1  family  members  (around  160  kD a)- 
NRIF3  contains  a  putative  nuclear  loca hzation  signa  l  )> 
as  well  as  one  copy  of  an  LXXLL  motif  (ammo ^ac  ds  9  to  13) 
that  was  recently  identified  as  being  essential  i :o  the *  intera 
tion  of  a  number  of  putative  coactivators  with  nuclear  recep 

,0A ‘debase  search  identified  two  highly  related  homok^  of 

NR1F3,  which  were  previously  designated  B3-< 

onri  UntJ  forms  (67)  The  endonexm  short  form  (bnb)  w 

originally  isolated  from  a  two-hybrid .screen 

factors  that  interact  with  the  cytoplasmic  ta.l  of  ^ . 

(67).  The  long  form  (EnL)  was  then  >denl>fied  as  "n  alte 

lively  spliced  product  of  the  same  gene.  However,  the  g 

form  does  not  bind  to  integrin  P3(67)' Nu?t?*‘s  or  e„l  in¬ 
comparisons  between  cDNAs  of  NRIF3  a"  ,  nr0(juCt  of  the 
dicate  that  NRIF3  is  a  third  alternatively  spliced  P^d  Qf 

same  gene  (alignment  not 

the  two  endonexin  proteins  is  under  inves  g  ( 

66a  and  see  Discussion). 
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1  CAGCGGCAGTGGTGCTTTCCCGAATCTCAGAATGCCTGTTAAAAGATCACTGAAGTTGGA 

MPVKRSLK  L  D  10 

61  TGGTCTGTTAGAAGAAAATTCATTTGATCCTTCAAAAATCACAAGGAAGAAAAGTGTTAT 

G  L  L  E  E  N  S  F  D  P  S  K  I  T  R  K  K  S  V  I  30 

121  AACTTATTCTCCAACAACTGGAACTTGTCAAATGAGTCTATTTGCTTCTCCCACAAGTTC 

TYSPTTGTCQMSLFA5PT5S  50 

181  TGAAGAGCAAAAGCACAGAAATGGACTATCAAATGAAAAGAGAAAAAAATTGAATCACCC 

EEQKHRNGLSNE  K  R  K  K  L  N  H  P  70 

241  CAGTTTAACTGAAAGCAAAGAATCTACAACAAAAGACAATGATGAATTCATGATGTTGCT 

SLTESKESTTKDNDEFMMLL  90 

301  ATCAAAAGTTGAGAAATTGTCAGAAGAAATCATGGAGAT aatgcaaaatttaagtagtat 

SKVEKLSEEIMEIMQNLSSI  110 

361  acaggctttggagggcagtagagagcttgaaaatctcattggaatctcctgtgcatcaca 

qALEGSRELENLIGISCASH  130 

421  tttcttaaaaagagaaatgcagaaaaccaaagaactaatgacaaaagtgaataaacaaaa 

FLKREMQKTKE  LMTKVNKQK  150 

481  actgtttgaaaagagtacaggacttcctcacaaagcatcacgtcatcttgacagctatga 

LFEKSTGLPHK  A.  5  R  H . 1.  D  ...$...  Y.....E..  170 

541  attccttaaagccattttaaactgaggcattaagaagaaatgcactcaccatgagcacca 

F . L....K  ...A . I.....L...N  * 

FIG.  2.  Nucleotide  and  deduced  amino  acid  sequences  of  NRIF3.  Only  part 
of  the  cDNA  sequence  is  shown.  A  putative  nuclear  localization  signal  (KRKK) 
is  underlined.  The  putative  LXXLL  motif  is  shown  with  a  double  underline. 
NRIF3  and  EnL  have  95%  identity.  They  differ  only  in  their  C  termini,  where  the 
last  16  amino  acids  (dotted  underline)  in  NRIF3  are  replaced  with  9  different 
amino  acids  (GQPQMSQPL)  in  EnL.  EnS  consists  of  111  amino  acids  and  is 
100%  identical  to  the  first  111  amino  acids  of  NRIF3  or  EnL. 


NRIF3  localizes  to  the  cell  nucleus.  Although  a  putative 
nuclear  localization  signal  was  found  in  NRIF3,  we  considered 
it  important  to  identify  the  subcellular  location  of  the  NRIF3 
protein  since  extensive  homology  was  found  between  NRIF3 
and  the  two  endonexins.  The  entire  NRIF3  open  reading 
frame  was  fused  to  the  C  terminus  of  GFP  (18).  The  resulting 
GFP-NRIF3  fusion  protein  was  expressed  in  HeLa  cells  by 
transient  transfection,  and  the  subcellular  location  of  the  fu¬ 
sion  protein  was  visualized  by  fluorescence  microscopy.  As 
shown  in  Fig.  3,  the  control  GFP  protein  distributed  through¬ 
out  the  cell  while  GFP-NRIF3  localized  exclusively  to  the 
nucleus.  This  result  suggests  that  NRIF3  is  a  nuclear  protein, 
which  is  compatible  with  its  putative  role  as  a  nuclear  receptor 
coactivator. 

Selective  interaction  of  NRIF3  with  liganded  nuclear  recep¬ 
tors  in  yeast.  Although  NRIF3  was  originally  cloned  with  full- 
length  TRa  as  the  bait,  we  later  identified  that  the  region  of 
the  receptor  responsible  for  NRIF3  binding  is  its  LBD  (Fig.  1). 
A  common  feature  among  most  of  the  known  coactivators  that 
show  ligand-dependent  interaction  with  nuclear  receptors  is 
the  presence  of  the  LXXLL  motif(s)  in  their  receptor  interac¬ 
tion  domains.  The  LXXLL  motif  appears  to  be  involved  in 
direct  contact  with  a  structurally  conserved  surface  in  the  li¬ 
gand-bound  LBDs  of  the  receptors  (23),  which  may  provide 


GFP  GFP-NRIF3 


FIG.  3.  NRIF3  is  a  nuclear  protein.  HeLa  cells  were  transfected  with  an 
expression  vector  for  GFP  (left  pane!)  or  GFP-NRIF3  (right  panel).  The  cellular 
location  of  the  expressed  proteins  was  visualized  by  fluorescence  microscopy. 


TABLE  1.  Interaction  of  NRIF3  with  nuclear  receptors  in  yeast® 


Bait 

Prey 

j3-Galactosidase  activity 

Without  With 

ligand  ligand 

Fold 

stimulation 

LexA 

NRIF3-B42 

2.3 

1.9 

0.8 

LexA-TR 

NRIF3-B42 

1.8 

125 

69 

LexA-RAR 

NRIF3-B42 

0.1 

0.1 

1 

LexA-RXR 

NRIF3-B42 

0.2 

63 

315 

LexA-GR 

NRIF3-B42 

0.8 

0.6 

0.8 

«  The  LacZ  reporter  activities  were  determined  for  yeast  strains  harboring  the 
indicated  bait  and  prey  plasmids  in  the  presence  or  absence  of  cognate  ligands  as 
described  in  Materials  and  Methods.  See  the  text  for  detailed  explanations. 


the  molecular  basis  for  the  broad  spectrum  of  receptor  binding 
by  coactivators  such  as  SRC-1  and  GRIP1.  Since  a  putative 
LXXLL  motif  is  also  present  in  NRIF3  (amino  acids  9  to  13), 
we  asked  whether  NRIF3  also  interacts  with  the  LBDs  of  other 
nuclear  receptors. 

The  LBDs  of  several  nuclear  receptors  were  examined  for 
interaction  with  NRIF3  in  a  yeast  two-hybrid  assay.  As  shown 
in  Table  1,  NRIF3  does  not  interact  with  LexA  alone  (negative 
control)  with  or  without  ligand.  LexA-TR  and  LexA-RXR 
showed  little  (if  any)  interaction  with  NRIF3  in  the  absence  of 
their  cognate  ligands.  However,  the  presence  of  T3  (for  TR)  or 
9-cis  RA  (for  RXR)  resulted  in  a  strong  stimulation  of  their 
interaction  with  NRIF3,  as  indicated  by  the  induction  of  (3-ga- 
lactosidase  activity  (Table  1).  Interestingly,  when  LexA-RAR 
or  LexA-GR  was  used  as  the  bait,  no  interaction  was  detected 
with  NRIF3  in  the  presence  or  absence  of  their  cognate  ligands 
(Table  1).  The  finding  that  NRIF3  interacts  with  TR  but  not 
RAR  was  surprising  in  light  of  a  recent  study  which  showed 
that  TR  and  RAR  functionally  interact  with  the  same  LXXLL 
boxes  (boxes  2  and  3)  of  SRC-l/NCoA-1  (52).  As  positive 
controls,  we  confirmed  that  both  LexA-RAR  and  LexA-GR 
exhibited  ligand-dependent  interaction  with  other  coactivators 
that  are  not  receptor  specific  (data  not  shown).  Taken  to¬ 
gether,  these  results  suggest  that  NRIF3  exhibits  differential 
specificities  in  its  interactions  with  different  nuclear  receptors. 

NRIF3  specifically  binds  to  TR  and  RXR  but  not  to  other 
nuclear  receptors  in  vitro.  To  further  examine  the  interaction 
between  NRIF3  and  various  nuclear  receptors  as  well  as  to 
confirm  the  potential  receptor  specificity  of  NRIF3,  in  vitro 
GST  binding  assays  were  performed  (30).  35S-labeled  nuclear 
receptor,  generated  by  in  vitro  transcription  and  translation, 
was  incubated  with  purified  GST-NR1F3  or  the  GST  control 
bound  to  glutathione-agarose  beads.  All  binding  assays  were 
carried  out  with  or  without  the  cognate  ligand  of  the  examined 
receptor.  As  shown  in  Fig.  4  (top  left),  TR  and  NRIF3  interact 
poorly  in  the  absence  of  T3.  Addition  of  T3  resulted  in  a  strong 
increase  in  TR  binding  to  GST-NRIF3,  confirming  that  NRIF3 
associates  with  TR  in  a  T3-dependent  manner.  Using  similar 
binding  assays,  we  also  studied  the  interaction  of  NRIF3  with 
six  other  nuclear  receptors.  Consistent  with  our  findings  from 
the  yeast  two-hybrid  experiments  (Table  1),  NRIF3  interacted 
with  RXR  in  vitro  in  a  ligand-dependent  manner  (Fig.  4)  but 
showed  little  or  no  binding  to  other  nuclear  receptors  (RAR, 
VDR,  GR,  PR,  and  ER)  in  the  presence  or  absence  of  their 
cognate  ligands  (Fig.  4).  Taken  together,  the  results  of  the 
yeast  two-hybrid  (Table  1)  and  the  in  vitro  binding  (Fig.  4) 
assays  suggest  that  NRIF3  possesses  a  distinct  receptor  speci¬ 
ficity. 

NR1F3  selectively  potentiates  TR-  and  RXR-mediated  trans¬ 
activation  in  vivo.  To  examine  the  potential  role  of  NRIF3  in 
TR-mediated  transactivation,  transfection  studies  were  carried 
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or  hER)  was  incubated  with  an  affinity-purified  GST  control  Aft  incubation  and  washing  the  bound  receptors  were  analyzed  by  sodium  dodecyl 

used  in  each  incubation.  GST-RXR  was  used  as  a  positive  control  for  RAR  bindtng. 


out.  HeLa  cells,  which  lack  endogenous  TR  (25),  were  trans¬ 
fected  with  a  vector  expressing  TR  and  a  CAT  reporter  under 
the  control  of  the  AMTV  basal  promoter  linked  to  an  idealized 
IR  (AGGTCATGACCT)  TRE  sequence  (IR-AMTV-CAT) 
(25),  along  with  either  a  control  plasmid  or  a  vector  expressing 
NRIF3.  As  shown  in  Fig.  5A,  NRIF3  significantly  enhances 
TR-mediated  activation  of  the  CAT  reporter  (typically  2.5-  to 
3-fold).  As  a  control,  we  also  examined  the  effect  of  CBP,  a 
reported  coactivator  for  nuclear  receptors  (13,  37),  and  found 
that  its  expression  results  in  a  degree  of  enhancement  similar 
to  that  with  NRIF3  (around  threefold)  (Fig.  5A). 

We  also  examined  another  CAT  reporter  controlled  by  the 
herpesvirus  tk  promoter  linked  to  native  rat  growth  hormone 
TRE  sequences  (5).  NRIF3  was  found  to  also  enhance  TR- 
mediated  activation  of  this  reporter  (about  3.5-fold)  (Fig.  5B). 


In  addition,  using  similar  transfection  assays,  we  found  that 
NRIF3  enhances  TR-mediated  activation  of  two  other  report¬ 
ers,  (IR)2-TATA-CAT  and  DR4-AMTV-CAT  (data  not  shown). 
Therefore,  NRIF3  potentiates  TR-mediated  transactivation  in 
a  variety  of  different  TRE  and  promoter  contexts.  Taken  to¬ 
gether,  the  results  of  these  transfection  studies  suggest  that 
NRIF3  can  function  as  a  coactivator  of  TR. 

To  examine  whether  NRIF3  can  also  act  as  a  coactivator  for 
RXR,  HeLa  cells  were  transfected  with  the  IR-AMTV-CAT 
reporter,  whose  IR  sequence  can  also  function  as  a  strong 
response  element  for  the  RXR(s)  and  RAR(s)  (25,  49,  61). 
HeLa  cells  express  the  endogenous  RXR(s)  and  RAR(s),  as 
the  activity  of  the  IR-AMTV-CAT  reporter  was  strongly  stim¬ 
ulated  by  their  cognate  ligands,  even  without  cotransfection  of 
any  receptor  expression  plasmid  (Fig.  6A,  bars  1,  3,  and  5). 


CBP  ▼  ,  . 

FIO.  5.  NRIF3  enhances  TR-mcdialctl  itansaclivalion  in  vivo.  HeLa  cells  were  iransfccted  with  !|^0Lf*p^^"^I^“ing‘,NlJj|F3<'0flhlTmpty  control  venire 

»"•- -  «*»  -  «•*- » ■“  - NRIF!' 0H'  “ 
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a 

1  IR  I - 1  AMTV-CaT| 


B 

I  IR  1 - 1  flMTV-CAT  j 


C 

I  DR1  1 - 1  AMTV-CAT~1 


NRIF3 
9-cis  R  A 
LG100153 
TTNPB 


.  + 


-  + 


RXR  +  +  +  +  RXR  +  + 

NRIF3  +  +  NR,F3  + 

9-cisRA  -  +  -  +  9-cisRA  -  +  -  + 

LG100153  -  +  *  + 

FIG.  6.  NRIF3  functions  as  a  coactivator  for  RXR  but  not  RAR.  (A)  NRIF3  potentiates  the  activity  of  the  endogenous  RXR(s)  but  _not  the ;  RAR(s).  HeLa  cells 
were  transfected  with  the  IR-AMTV-CAT  reporter  (without  any  receptor  expression  vector)  to  examine  the  activation  by  endo8en°u”e,in°'d  r^P‘°rp 
expression  vector  or  the  empty  control  vector  was  cotransfected  to  examine  the  effect  of  NRIF3  on  the  activity  of  the  endogenous  RXR(s)  or  RAR(s).  Relative  CAT 
activity  was  determined  in  the  presence  (filled  bars)  or  absence  (hatched  bars)  of  the  indicated  ligands  (1  p-M).  (B  and  C)  NRIF3  potentiates  the  activity  of  the 
exogenously  expressed  RXR.  A  vector  expressing  hRXRa  was  cotransfected  into  HeLa  cells  with  the  IR-AMTV-CAT  reporter  B)  or  the  DR1-AMTV-CAT  reporter 
(C)Tn  the  presence  (filled  bars)  or  absence  (hatched  bars)  of  the  indicated  ligands  (1  p.M).  The  effect  of  NRIF3  on  RXR-med,ated  transact, vat.on  was  examined  as 
described  for  panel  A.  TTNPB,  a  synthetic  ligand  for  ILAR. 


Cotransfection  of  NRIF3  enhanced  the  activation  of  this  re¬ 
porter  by  either  9-cis  RA,  or  LG100153  (72),  an  RXR-specific 
ligand  (Fig.  6A,  bars  1  and  2  and  bars  3  and  4).  In  contrast, 
although  the  RAR-specific  ligand  TTNPB  (68)  also  activated 
the  IR-AMTV-CAT  reporter,  cotransfection  of  NRIF3  had 
no  effect  (Fig.  6A,  bars  5  and  6).  These  results  indicate  that 
NRIF3  potentiates  the  activity  of  the  endogenous  RXR(s)  but 
not  the  RAR(s),  which  is  consistent  with  the  distinct  receptor 
specificity  of  NRIF3  revealed  from  the  yeast  two-hybrid  assay 
(Table  1)  and  in  vitro  binding  experiments  (Fig.  4). 

To  further  document  that  NRIF3  can  function  as  a  coacti¬ 
vator  for  RXR,  a  vector  expressing  exogenous  RXR  was  co¬ 
transfected  with  IR-AMTV-CAT.  Exogenous  RXR  expression 
enhanced  the  activation  of  this  CAT  reporter  by  either  9-cis 
RA  or  LG100153  (compare  Fig.  6A  and  B,  bars  1  and  3).  This 
RXR-mediated  activation  of  reporter  expression  was  further 
stimulated  by  NRIF3  (Fig.  6B).  Finally,  we  also  examined  the 
activation  of  a  DR1-AMTV-CAT  reporter.  This  DR1  (AGGT 
CANAGGTCA  [where  N  is  any  nucleotide])  sequence  is 
thought  to  be  a  specific  response  element  for  RXR  (39,  51). 
Although  we  found  that  this  DR1  is  a  weaker  response  ele¬ 
ment  than  the  IR  sequence,  cotransfection  of  an  RXR  expres¬ 
sion  vector  led  to  ligand-induced  activation  of  this  DR1  re¬ 
porter,  which  was  also  further  enhanced  by  NRIF3  (Fig.  6C). 

NRIF3  does  not  potentiate  the  activities  of  GR,  PR,  ER,  and 
VDR  in  vivo.  The  selective  coactivation  of  TR  and  RXR  (but 
not  RAR)  by  NRIF3  is  consistent  with  its  distinct  binding 
specificities  to  these  receptors.  To  further  establish  that  NRIF3 
acts  as  a  receptor-specific  coactivator,  we  next  examined  the 
effect  of  NRIF3  on  the  activities  of  four  additional  nuclear 
receptors,  including  GR,  PR,  ER,  and  VDR,  by  transfection 
experiments.  HeLa  cells  were  transfected  with  a  GRE/PRE- 
tk-CAT  reporter  along  with  a  vector  expressing  either  GR  or 
PR.  As  shown  in  Fig.  7A,  cognate  hormone  treatment  results 
in  activation  of  the  CAT  reporter.  However,  expression  of 
NRIF3  has  little  effect  (Fig.  7A).  Similar  experiments  were 
carried  out  with  ER  and  ERE-AMTV-CAT  or  VDR  and 
VDRE-AMTV-CAT.  As  shown  in  Fig.  7B  and  C,  NRIF3  was 


found  to  have  little  or  no  effect  on  the  activities  of  these 
receptors  as  well.  Taken  together,  the  combined  results  of  our 
transfection  studies  support  the  notion  that  NRIF3  is  a  coac¬ 
tivator  with  a  unique  receptor  specificity. 

A  novel  C-terminal  domain  in  NRIF3  is  essential  for  ligand- 
dependent  interactions  with  TR  and  RXR.  The  LXXLL  sig¬ 
nature  motif  has  been  found  to  be  present  in  the  receptor- 
interacting  domains  of  many  identified  coactivators,  such  as 
SRC-l/NCoA-1  and  GRIP1/TIF-2  (32).  The  broad  spectrum 
of  receptor  binding  by  coactivators  such  as  SRC-1  suggests  that 
the  LXXLL-containing  interacting  domain  may  recognize 
structurally  similar  surfaces  of  these  LBDs.  Indeed,  recent 
structural  and  functional  studies  revealed  that  the  LXXLL 
motif  and  its  nearby  flanking  amino  acids  are  involved  in  direct 
contact  with  a  hydrophobic  cleft  of  the  target  surfaces  pre¬ 
sented  by  the  ligand-bound  LBDs  of  nuclear  receptors  (19,  23, 
52,  56).  The  facts  that  NRIF3  also  contains  an  LXXLL  motif 
(amino  acids  9  to  13)  (Fig.  2  and  8A)  and  exhibits  a  distinct 
receptor  specificity  raise  the  possibility  that  (i)  the  motif  and 
surrounding  amino  acids  are  involved  in  mediating  receptor- 
specific  interaction  of  NRIF3  or  (ii)  another  region  of  NRIF3 
(alone  or  in  concert  with  the  LXXLL  motif  region)  plays  an 
important  role  in  mediating  such  an  interaction. 

To  explore  these  issues,  we  examined  whether  EnS  and  EnL, 
which  contain  the  same  LXXLL  motif  and  flanking  amino 
acids  as  NRIF3,  can  interact  with  nuclear  receptors  in  a  yeast 
two-hybrid  assay  (Fig.  8).  EnS  consists  of  111  amino  acids  and 
is  100%  identical  to  the  first  111  residues  of  NRIF3,  while  EnL 
consists  of  170  amino  acids,  the  first  161  of  which  are  also 
100%  identical  to  the  same  region  in  NRIF3  (Fig.  2  legend  and 
Fig.  8A).  Thus,  NRIF3  and  EnL  differ  only  in  their  C  termini, 
with  NRIF3  having  a  unique  region  of  16  amino  acids  and  EnL 
having  a  unique  region  of  9  amino  acids  (Fig.  8A).  Interest¬ 
ingly,  despite  their  extensive  identity  with  NRIF3,  the  interac¬ 
tion  with  liganded  TR  or  RXR  is  completely  abolished  in  EnS 
and  EnL  (Fig.  8B).  We  also  examined  other  nuclear  receptors 
that  do  not  interact  with  NRIF3  and  found  that  they  also  do 
not  interact  with  EnS  or  EnL  (data  not  shown).  These  results 


Vol.  19,  1999 


CO  ACTIVATOR  AND  RECEPTOR  SPECIFICITY  7197 


NHIrJ  +  . 

FIG.  7.  NRIF3  does  not  potentiate  the  activity  ^R' (B)tra“df^RE-4MW-CAT  and^WR^C)-  Cells  were  incubated  in  the 

^Lr:nl°%lCd”ar?)Ror^^eth^h“dd  Kof  100  1J  dexame.hasone  for  OR,  progesterone  for  PR,  estradiol  for  ER,  and  l,25-(OH)2-vitami»  D3  for  VDR. 
Cotransfection  of  NRIF3  was  found  to  have  little  effect  on  the  activities  of  these  receptors. 


indicate  that  the  unique  C-terminal  domain  in  NRIF3  (NCD) 
(residues  162  to  177)  is  essential  for  its  specific  interaction  with 
liganded  TR  and  RXR  while  the  N-terminal  LXXLL  motif 
(amino  acids  9  to  13)  and  its  flanking  sequences  are  not  suffi- 
cient  to  allow  for  detectable  receptor  interactions. 

Although  the  LXXLL  motif  was  found  to  be  insufficient  for 
interaction,  we  examined  whether  this  N-terminal  motif  of 
NRIF3  contributes  to  the  NRIF3-receptor  interaction  by 
mutating  the  first  leucine  of  the  LXXLL  motif  into  alanine 
(L9A)  by  site-directed  mutagenesis.  Previous  experiments 
have  shown  that  the  three  leucine  residues  are  essential  for  an 
LXXLL  module  to  interact  with  receptor  LBDs  and  that  the 
replacement  of  any  of  them  with  alanine  abolishes  the  inter¬ 
action  (32).  We  examined  the  L9A  NRIF3  mutant  form  for  its 
interaction  with  TR  and  RXR  in  a  yeast  two-hybrid  assay.  As 


shown  in  Fig.  9,  the  L9A  mutant  was  still  capable  of  ligand- 
dependent  interaction  with  TR  and  RXR  (~25-fold  induction 
by  ligand).  However,  the  introduced  mutation  reduced  the 
interaction  by  about  4-fold  (for  TR)  or  14-fold  (for  RXR). 
These  results  suggest  that  although  the  LXXLL  motif  is  not 
absolutely  essential  for  NRIF3  interaction  with  liganded  re¬ 
ceptors,  it  plays  a  role  in  allowing  an  optimum  interaction  to 


occur. 

Computer  modeling  suggests  that  the  NCD  docks  into  the 
hydrophobic  cleft  of  the  liganded  LBDs.  Secondary-structure 
analysis  of  the  C-terminal  domain  of  NRIF3  predicted  the 
formation  of  an  a-helix.  Moreover,  inspection  of  the  putative 
C-terminal  helix  revealed  an  LXXIL  motif  (amino  acids  172  to 
176),  which  is  reminiscent  of  the  canonical  LXXLL.  Alttoug 
the  ultimate  elucidation  of  the  molecular  basis  of  the  NKlr> 


A 
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FIG  8.  The  NCD  is  essentia!  for  the  interaction  with  liganded  TR  or  RXR.  (A)  Schematic 

two-hybrid  assay  as  de^rLe^n^atedauTnd'Mwhods  The  assaysUe  performed  in  the  absence  (hatched  bars)  or  the  presence  (filled  bars)  of  H* 
or  9 -or  RA  (for  RXR). 


7198  LI  ET  AL. 


Mol.  Cell.  Biol. 


Baits  TR  TR  RXR  RXR 

Preys  WT  L9A  WT  L9A 

FIG.  9.  The  LXXLL  motif  of  NRIF3  is  required  for  optimum  interaction 
with  TR  and  RXR.  Wild-type  NRIF3  (WT)  or  the  L9A  NRIF3  mutant  (L9A) 
was  examined  for  interaction  with  LexA-TR  or  LexA-RXR  in  a  yeast  two-hybrid 
assay  as  described  in  Materials  and  Methods.  p-Galactosidase  activities  were 
determined  in  the  absence  (filled  bars)  or  presence  (stippled  bars)  of  cognate 
ligands  (1  [xM  T3  for  TR;  1  p,M  9-cis  RA  for  RXR). 


receptor  interaction  awaits  future  studies  such  as  X-ray  crys¬ 
tallography,  the  putative  helix  structure  of  the  NCD  and  its 
LXXIL  motif  suggest  that  the  NCD  may  interact  with  the 
liganded  LBDs  in  a  fashion  similar  to  that  of  the  receptor¬ 
interacting  domains  that  employ  the  canonical  LXXLL  mo¬ 
tifs).  To  explore  this  possibility,  we  modeled  the  interaction  of 
the  C  terminus  of  NRIF3  with  the  liganded  LBDs,  using  algo¬ 
rithms  developed  mainly  by  the  staff  of  the  laboratory  of  one  of 
the  authors  (R.  Abagyan  and  coworkers)  (1,  63,  70,  74,  75). 
The  background  information  and  procedures  used  for  con¬ 
structing  these  models  are  described  in  Materials  and  Meth¬ 
ods.  The  results  of  our  modeling  suggest  that  the  NCD  fits  well 
into  the  hydrophobic  cleft  formed  on  the  LBDs  as  a  result  of 
ligand  binding.  An  example  of  such  a  model  (NCD-TR  LBD) 
is  shown  in  Fig.  10.  In  this  model,  the  two  leucines  and  one 
isoleucine  of  the  LXXIL  motif  are  predicted  to  be  deeply 
buried  in  the  central  cavity  of  the  hydrophobic  groove  formed 
by  the  liganded  LBD  of  the  receptor.  We  also  calculated  the 
putative  binding  energy  for  the  modeled  NCD-TR  complex, 
using  an  improved  partitioning  binding  energy  function,  with 
continuum  representation  of  the  electrostatics  of  the  system 
(64).  The  calculated  binding  energy  for  the  modeled  NCD-TR 
complex  is  about  -21  kcal/mol.  As  a  control,  we  carried  out  a 
similar  modeling  procedure  using  the  second  LXXLL  box 
within  the  receptor-interacting  domain  of  SRC-1.  This  LXXLL 
box  has  been  shown  to  be  required  for  interaction  with  TR 
(52).  Our  calculated  binding  energy  for  this  LXXLL  box  with 
liganded  TR  LBD  is  -18  kcal/mol,  a  value  that  is  very  close  to 
the  one  calculated  for  the  NCD.  Altogether,  our  modeling  and 
calculations  suggest  a  mechanism  in  which  the  NCD  directly 
mediates  interaction  with  liganded  LBDs  through  an  LXXIL 
motif. 

Functional  interaction  of  the  NCD  with  liganded  LBDs  and 
the  essential  role  of  its  LXXIL  motif.  To  explore  the  possibility 
suggested  from  our  computer  modeling,  the  NCD  (amino  acids 
162  to  177)  was  fused  to  the  LexA  DNA  binding  domain  and 
was  examined  for  interaction  with  the  receptor  LBDs  in  a  yeast 
two-hybrid  assay.  The  LexA-NCD  fusion  protein  alone  does 
not  activate  the  LacZ  reporter  in  yeast  (data  not  shown).  As  a 
negative  control,  we  also  found  that  LexA-NCD  does  not  in¬ 
teract  with  the  B42  activation  domain  itself  (Fig.  11)  and  that 
LexA  alone  does  not  interact  with  the  receptor  LBDs  (data  not 


FIG.  10.  Hypothetical  model  of  the  interaction  of  the  NCD  and  the  liganded 
LBD.  The  docking  of  the  C-terminal  helix  of  NRIF3,  which  contains  an  LXXIL 
module,  to  the  ligand-bound  LBDs  was  carried  out  as  described  in  Materials  and 
Methods.  The  NCD-TR  LBD  model  is  shown  here  as  an  example.  The  side 
chains  of  the  two  leucines  (green)  and  one  isoleucine  (cyan)  of  the  LXXIL  core 
fit  within  a  hydrophobic  groove  (salmon)  on  the  surface  of  the  liganded  LBD 
(80).  A  similar  modeling  procedure  was  carried  out  with  an  LXXLL  box  of 
SRC-1  (result  not  shown).  Putative  binding  energies  (-21  kcal/mol  for  the  NCD 
and  -18  kcal/mol  for  the  LXXLL  box  of  SRC-1)  were  calculated  as  described  in 
Materials  and  Methods.  See  the  text  for  details. 


shown).  However,  when  the  LexA-NCD  and  the  LBD  of  TR  or 
RXR  (fused  with  B42)  were  used  in  the  two-hybrid  assay,  a 
strong  ligand-dependent  interaction  was  observed,  as  indicated 
by  the  induction  of  p-galactosidase  activity  by  their  cognate 
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Baits  WT  WT  WT  WT  Mut  Mut  Mut 

Preys  B42  TR  RXR  RAR  TR  RXR  RAR 

FIG.  11.  Interaction  of  the  NCD  with  the  receptor  LBDs  and  the  role  of  the 
LXXIL  motif.  The  wild-type  NCD  (WT)  or  the  NCD  mutant  form  (Mut)  in 
which  the  three  core  hydrophobic  residues  of  the  LXXIL  motif  (two  leucines  and 
one  isoleucine)  are  changed  into  alanines  was  examined  for  interaction  with  the 
LBDs  of  TR,  RXR,  and  RAR  in  a  yeast  two-hybrid  assay  as  described  in 
Materials  and  Methods.  (3-GaIactosidase  activities  were  determined  in  the  ab¬ 
sence  (open  bars)  or  presence  (stippled  bars)  of  cognate  ligands.  The  prey 
expressing  B42  alone  was  used  as  a  negative  control. 
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ligands  (Fig.  11).  These  results  suggest  that  the  NCD  can 
directly  interact  with  the  LBDs  of  TR  and  RXR  in  a  ligand- 
dependent  manner. 

Since  NRIF3  harbors  a  distinct  receptor  specificity  in  inter¬ 
acting  only  with  TR  and  RXR  and  not  other  receptors  (e.g., 
RAR),  we  next  asked  whether  the  NCD  also  harbors  a  recep¬ 
tor  specificity.  To  our  surprise,  the  NCD  was  found  to  interact 
efficiently  with  the  LBD  of  RAR  in  a  ligand-dependent  manner 
(Fig.  11).  Therefore,  while  our  results  clearly  suggest  that  the 
NCD  is  an  important  surface  for  receptor  interactions,  as  the 
NCD  is  found  to  be  both  essential  for  (Fig.  8)  and  sufficient  to 
mediate  (Fig.  11)  such  interactions,  it  nevertheless  does  not 
appear  to  be  (solely)  responsible  for  the  receptor  specificity  of 
NRIF3.  It  is  possible  that  another  region  of  the  NRIF3  mole¬ 
cule  contributes  to  the  observed  receptor  specificity  of  NRIF3 
and/or  that  the  specificity  is  determined  by  the  overall  three- 
dimensional  structure  of  NRIF3. 

Since  our  model  predicts  the  importance  of  the  LXXIL 
motif  in  the  NCD-receptor  interaction  (Fig.  10),  we  tested  this 
by  changing  the  three  core  residues  of  the  motif  (two  leucines 
and  one  isoleucine)  into  alanines.  As  expected,  interaction 
with  the  LBDs  is  completely  abolished  in  the  resulting  mutant 
NCD  (Fig.  11),  confirming  that  the  LXXIL  motif  is  essential 
for  the  interaction. 

DISCUSSION 

Recent  efforts  in  understanding  receptor-mediated  tran¬ 
scription  have  led  to  the  identification  of  a  number  of  co¬ 
activators  for  nuclear  hormone  receptors,  which  can  be  cat¬ 
egorized  into  two  main  groups  based  on  overall  homology, 
the  SRC-1  family  (including  SRC-l/NCoA-1,  TIF2/GRIP1/ 
NCoA-2,  and  AIBl/p/CIP/ACTR/RAC3/TRAM-l)  (2,  14,  34, 
35,  37,  44,  58,  73,  74,  79)  and  the  CBP/p300  family  (13,  31,  37). 
Other  putative  coactivators  (e.g.,  ARA70  and  PGC-1)  that  do 
not  belong  to  the  SRC-1  or  CBP/p300  family  have  also  been 
identified  (60.  85).  In  addition,  p/CAF  may  also  be  involved  m 
receptor  action  through  its  association  with  nuclear  receptors 
as  well  as  with  other  coactivators  (11,  14,  38,  83).  Among  these 
known  coactivators,  CBP/p300,  members  of  the  SRC-1  group, 
and  p/CAF  all  possess  histone  acetyltransferase  activities  (8, 
14,  57,  69,  83). 

In  this  study  we  report  the  identification  of  a  novel  nuclear 
protein  (NRIF3)  which  exhibits  specific  ligand-dependent  in¬ 
teractions  with  TR  and  RXR  but  not  with  RAR,  VDR,  GR, 
PR,  or  ER.  Functional  experiments  indicated  that  NR1F3  po¬ 
tentiates  TR-  and  RXR-mediated  transactivation  in  vivo  but 
exhibits  little  or  no  effect  on  the  activities  of  other  examined 
receptors.  Therefore,  NRIF3  represents  a  novel  coactivator 
with  a  distinct  receptor  specificity  and,  thus,  may  shed  light  on 
clarifying  the  molecular  mechanism(s)  underlying  receptor- 
specific  regulation  of  gene  expression. 

A  database  search  indicated  that  NRIF3  has  no  homology 
with  any  known  coactivators  except  in  a  single  LXXLL  motif. 
An  unusual  feature  of  NRIF3  is  its  relatively  small  size,  which 
is  in  sharp  contrast  to  the  sizes  of  SRC-1  and  CBP/p300.  A 
homology  search  identified  two  alternatively  spliced  isoforms 
of  NRIF3  which  were  previously  designated  (33-endonexin 
short  and  long  forms  (67).  Preliminary  studies  with  these  two 
endonexins  indicate  that,  like  NR1F3,  they  localize  to  the  cell 
nucleus  (43a.  66a).  Interestingly,  despite  their  extensive  iden¬ 
tities  with  NRIF3,  both  EnS  and  EnL  fail  to  exhibit  interaction 
with  liganded  nuclear  receptors  (Fig.  8).  Consistent  with  this 
finding,  we  found  that  EnS  and  EnL  have  little  effect  on  re¬ 
ceptor-mediated  transcription  in  transfection  experiments 
(data  not  shown).  Therefore,  the  precise  roles  of  these  two 


endonexins  remain  to  be  elucidated.  We  suggest  two  not  mu¬ 
tually  exclusive  possibilities.  First,  since  both  EnL  and  EnS 
appear  to  localize  to  the  nucleus,  it  is  possible  that  they  act  as 
cofactors  for  other  transcriptional  regulators.  Second,  since  the 
EnS  can  interact  with  the  cytoplasmic  tail  of  (33-integrin  (22, 
67),  it  may  communicate  signals  generated  at  the  plasma  mem¬ 
brane  to  the  cell  nucleus.  An  example  of  a  protein  which  is 
involved  in  both  cell  adhesion  and  transcriptional  regulation  is 


p-catenin  (82). 

Previous  study  of  the  endonexins  identified  the  presence  of 
NRIF3-related  mRNAs  (by  Northern  blotting)  in  a  wide  range 
of  human  tissues  (67).  Because  NRIF3  and  EnL  contain  almost 
identical  nucleotide  sequences  and  differ  only  by  an  alternative 
splice  site  which  results  in  the  removal  of  a  small  exon  in 
NRIF3,  it  is  difficult  to  specifically  identify  NRIF3  mRNA  by 
Northern  blotting.  A  search  of  the  expressed  sequence  tag 
database  indicates  that  NRIF3  mRNA,  as  well  as  both  EnL 
and  EnS  mRNAs,  is  expressed.  However,  the  precise  determi¬ 
nation  of  cell  and  tissue  distribution  of  NRIF3,  EnS,  and  EnL 
will  require  the  development  of  highly  selective  antibodies. 
Nevertheless,  the  wide  expression  pattern  of  NRIF3-related 
mRNAs  is  consistent  with  the  role  of  NRIF3  as  a  coactivator  of 


the  TRs,  which  are  also  widely  expressed  (70),  or  the  RXRs, 
which  are  ubiquitously  expressed  (48). 

A  key  goal  concerning  the  action  of  nuclear  hormone  recep¬ 
tors  is  to  understand  the  molecular  events  underlying  the  func¬ 
tional  specificities  of  different  receptors  in  regulating  the  ex¬ 
pression  of  their  target  genes.  Determinants  of  specificity 
include  specific  ligand  binding  and  selective  binding  of  the 
receptors  to  their  cognate  response  elements,  as  well  as  specific 
expression  patterns  of  different  receptors.  These  determinants 
alone,  however,  are  not  always  sufficient  to  explain  the  extents 
of  specificity  observed  for  members  of  the  nuclear  receptor 
family.  For  example,  several  members  of  the  thyroid  hormone/ 
retinoid  receptor  subfamily  may  bind  similarly  to  common 
DNA  elements  while  target  genes  containing  those  elements 
are  only  selectively  activated  by  certain  receptors  (20,  47). 
Therefore,  it  is  likely  that  additional  factors  (determined  by 
cell  and  promoter  contexts)  are  involved  in  determining  recep¬ 
tor  functional  specificity.  In  this  respect,  most  known  coacti¬ 
vators  do  not  appear  to  be  receptor  specific.  For  example, 
members  of  the  SRC-1  and  CBP/p300  families  interact  with 
and  appear  to  be  involved  in  the  actions  of  many  nuclear 
receptors  (13,  14,  34,  37).  Two  known  coactivators  that  may  be 
involved  in  receptor-specific  functions  are  ARA70  and  PGC-1. 
The  AR  coactivator  ARA70  has  been  reported  to  potentiate 
the  activity  of  AR  more  efficiently  than  it  does  the  activities  of 
other  nuclear  receptors  (85).  However,  whether  ARA70  can 
associate  with  other  receptors  remains  to  be  thoroughly  exam¬ 
ined.  The  expression  of  PGC-1  is  restricted  mainly  to  the 
brown  fat  tissue  and  is  thought  to  be  directly  involved  in  the 
regulation  of  thermogenesis  by  PPARy  (60).  Nevertheless, 
PGC-1  exhibits  a  relatively  broad  spectrum  of  binding  to  dif¬ 
ferent  nuclear  receptors.  Therefore,  the  identification  of 
NRIF3  represents  the  first  example  of  a  coactivator  with  such 
a  clearly  defined  receptor  specificity. 

The  receptor  specificity  of  NRIF3  raises  an  interesting  ques¬ 
tion  about  its  molecular  mechanism.  Domain  analysis  suggests 
that  the  LXXLL  motif  (amino  acids  9  to  13)  and  its  flanking 
sequences  in  NRIF3  are  not  sufficient  for  interaction  with 
liganded  nuclear  receptors.  In  fact,  such  interaction  is  com¬ 
pletely  abolished  in  EnL,  an  alternatively  spliced  product 
which  has  the  same  LXXLL  motif  and  contains  the  first  161 
amino  acids  (of  a  total  of  177  amino  acids)  of  NRIF3.  I  his 
result  suggests  that  a  putative  domain  consisting  of  the  Last  i 
amino  acids  of  NRIF3  (residues  162  to  177)  is  essential  for  its 
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interaction  with  liganded  receptors.  Inspection  of  this  NCD 
indicates  that  it  contains  an  LXXIL  motif  (amino  acids  172  to 
176),  and  secondary-structure  analysis  predicts  the  formation 
of  an  ot-helix.  The  predicted  helix  structure  and  the  similarity 
of  LXXIL  to  the  canonical  LXXLL  raise  the  possibility  that 
this  LXXIL-containing  region  plays  a  direct  role  in  NRIF3- 
receptor  interactions. 

Our  modeling  of  the  NCD-LBD  interaction  (Fig.  10)  sug¬ 
gests  that  the  same  hydrophobic  groove  in  the  ligand-bound 
LBD,  which  has  been  shown  by  previous  studies  to  be  the 
binding  site  for  coactivators  such  as  SRC-l/NCoA-1  and 
GRIP1  (19,  23,  56),  may  also  be  a  suitable  site  for  the  docking 
of  the  C-terminal  helix  of  NRIF3.  Thus,  the  utilization  of  the 
complementary  pair  of  an  a-helix  (in  the  coactivator)  and  a 
hydrophobic  groove  (in  the  receptor)  for  interaction  seems  to 
be  a  general  scheme  that  also  applies  to  NRIF3.  The  binding 
energy  estimated  for  the  NCD  and  the  TR  LBD  (-21  kcal/ 
mol)  is  similar  to  the  value  calculated  for  the  second  LXXLL 
box  of  SRC-l/NCoA-1  and  the  TR  LBD  (-18  kcal/mol).  To 
explore  the  mechanisms  suggested  by  the  modeling,  we  found 
that  the  NCD  can  directly  mediate  interaction  with  the  LBDs 
in  a  ligand-dependent  manner  (Fig.  11).  Moreover,  the  LXXIL 
motif  contained  in  the  NCD  was  found  to  be  essential  for  such 
interactions  (Fig.  11).  In  summary,  the  results  of  a  combination 
of  a  computer  modeling  and  domain  and  mutagenesis  analyses 
clearly  suggest  that  the  NCD  is  an  important  surface  that  is 
directly  involved  in  interaction  with  the  LBDs  of  the  receptors, 
where  the  LXXIL  motif  of  the  NCD  mimics  the  function  of  a 
canonical  LXXLL.  The  AF-2  helix  (which  is  a  critical  constit¬ 
uent  of  the  hydrophobic  groove  formed  upon  ligand  binding) 
of  the  LBD  has  been  shown  to  be  important  for  interaction 
with  the  LXXLL  boxes  of  the  coactivators  (23).  Interestingly, 
we  have  examined  two  TR  AF-2  mutants  (66)  and  found  that 
in  both  cases,  ligand-dependent  interaction  with  NRIF3  was 
abolished  (43a). 

However,  the  NCD  alone  does  not  appear  to  harbor  the 
same  specificity  as  NRIF3  (Fig.  11).  Thus,  it  seems  likely  that 
another  part  of  the  NRIF3  molecule  contributes  to  the  ob¬ 
served  specificity  and/or  that  the  specificity  is  determined  by 
the  overall  three-dimensional  structure  of  NRIF3.  In  this  re¬ 
gard,  the  potential  role  of  the  N-terminal  LXXLL  motif  is 
intriguing.  Although  the  N-terminal  LXXLL  motif  (amino  ac¬ 
ids  9  to  13)  is  insufficient  alone  to  mediate  an  interaction  with 
TR  or  RXR  (Fig.  8),  it  can  influence  the  interaction  of  NRIF3 
with  these  receptors,  as  the  L9A  NRIF3  mutant  retains  a 
significant  but  nevertheless  reduced  level  of  association  with 
liganded  TR  or  RXR  (Fig.  9).  Thus,  NRIF3  appears  to  employ 
at  least  two  regions  in  interacting  with  liganded  TR  or  RXR, 
with  the  NCD  playing  an  essential  role  and  the  N-terminal 
LXXLL  motif  playing  a  secondary  role.  A  simplified  explana¬ 
tion  for  these  findings  is  that  the  NCD  provides  a  major  surface 
for  receptor  binding  while  the  N-terminal  LXXLL  motif  makes 
some  minor  contact  with  either  the  same  receptor  molecule  or, 
more  likely,  with  the  other  partner  of  a  homodimer  or  het¬ 
erodimer  to  further  stabilize  the  NRIF3-receptor  interaction. 
An  example  of  a  coactivator  molecule  employing  two  separate 
regions  to  interact  with  the  two  partners  of  a  receptor  dimer 
has  been  documented  for  the  recently  solved  crystal  structure 
of  liganded  PPARy  complexed  with  SRC-l/NCoA-1  (56).  If 
NRIF3  indeed  employs  both  its  NCD  and  its  N-terminal 
LXXLL  motif  in  receptor  interactions,  the  specificity  may  re¬ 
sult  from  the  intramolecular  dialog  between  the  two  regions  as 
well  as  the  intermolecular  dialog  among  NRIF3  and  the  re¬ 
ceptors.  However,  it  remains  possible  that  the  N-terminal 
LXXLL  plays  only  a  more  indirect  role  and  that  the  overall 


three-dimensional  structure  of  NRIF3  is  responsible  for  its 
observed  specificity. 

Accumulating  evidence  suggests  that  the  actions  of  tran¬ 
scriptional  activating  proteins  are  (usually)  mediated  by  mul¬ 
tiprotein  complexes  (59),  and  such  a  scheme  is  also  likely  for 
nuclear  receptors.  For  example,  biochemical  evidence  suggests 
that  multiprotein  complexes  associate  with  liganded  TR  and 
VDR  (24, 62, 86).  Interestingly,  many  of  the  proteins  identified 
in  these  studies  are  not  known  coactivators.  While  the  study  of 
known  coactivators  such  as  CBP/p300  and  members  of  the 
SRC-1  family  has  suggested  that  histone  acetylation  may  play 
an  important  role  in  receptor-mediated  transactivation  (8, 14, 
57,  69),  detailed  elucidation  of  the  transactivation  mecha¬ 
nism^)  used  by  these  receptors  awaits  the  identification  and 
study  of  additional  cofactors  involved  in  the  transactivation 
process. 

Our  results  with  NRIF3  suggest  that  transcriptional  activa¬ 
tion  by  nuclear  receptors  may  employ  a  receptor-specific  co- 
activato^s)  in  addition  to  the  generally  used  coactivators  such 
as  CBP  and  SRC-1.  Therefore,  coactivators  with  NRIF3-like 
properties  may  contribute  to  the  functional  specificities  of  nu¬ 
clear  receptors  in  vivo.  Based  on  our  results  with  NRIF3  and 
the  results  of  previous  studies  of  nuclear  receptor  action,  we 
suggest  a  combinatorial  specificity  model  where  a  coactivation 
complex  is  likely  composed  of  two  kinds  of  factors:  general 
factors  that  interact  with  and  are  involved  in  the  action  of  many 
nuclear  receptors  (such  as  CBP  and  SRC-1)  and  specific  fac¬ 
tors  that  exhibit  receptor  specificity  (such  as  NRIF3).  In  addi¬ 
tion  to  interacting  with  the  liganded  receptor,  coactivators  may 
also  communicate  with  each  other  within  the  coactivation  com¬ 
plex  through  protein-protein  interactions  (e.g.,  CBP/p300  can 
interact  with  SRC-l/NCoA-1  or  p/CIP)  (37,  74, 84).  An  intrigu¬ 
ing  possibility  is  that  the  combinatorial  actions  of  specific  fac¬ 
tors  and  other  partners  involved  in  the  transactivation  process 
facilitate  the  recruitment  of  specific  coactivation  complexes  for 
different  receptors  (under  different  cell,  promoter,  and  HRE 
contexts),  which  would  provide  an  important  mechanistic  layer 
for  receptor  functional  specificity.  An  advantage  of  employing 
such  a  combinatorial  strategy  is  that  a  broad  array  of  diversity 
can  be  generated  from  a  relatively  small  number  of  involved 
factors.  Further  study  of  NRIF3  with  known  and  possibly  other 
yet  to  be  identified  coactivators,  as  well  as  analysis  of  the 
interplay  among  these  coactivators,  should  provide  important 
insights  into  the  molecular  mechanism(s)  underlying  the  spec¬ 
ificity  of  receptor-mediated  regulation  of  target  gene  expres¬ 
sion. 
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Nuclear  hormone  receptors  (NRs)  are  potential  targets  for  thera¬ 
peutic  approaches  to  many  clinical  conditions,  including  cancer, 
diabetes,  and  neurological  diseases.  The  crystal  structure  of  the 
ligand  binding  domain  of  agonist-bound  NRs  enables  the  design  of 
compounds  with  agonist  activity.  However,  with  the  exception  of 
the  human  estrogen  receptor-a,  the  lack  of  antagonist-bound 
"inactive"  receptor  structures  hinders  the  rational  design  of  re¬ 
ceptor  antagonists.  In  this  study,  we  present  a  strategy  for  design¬ 
ing  such  antagonists.  We  constructed  a  modei  of  the  inactive 
conformation  of  human  retinoic  acid  receptor-a  by  using  informa¬ 
tion  derived  from  antagonist-bound  estrogen  receptor-a  and  ap¬ 
plied  a  computer-based  virtual  screening  algorithm  to  identify 
retinoic  acid  receptor  antagonists.  Thus,  the  currently  available 
crystal  structures  of  NRs  may  be  used  for  the  rational  design  of 
antagonists,  which  could  lead  to  the  development  of  novel  drugs 
for  a  variety  of  diseases. 

Members  of  the  nuclear  hormone  receptor  (NR)  family  are 
under  the  control  of  a  wide  variety  of  hormones  and 
ligands,  such  as  steroids,  retinoids,  thyroid  hormone,  1,25- 
dihydroxy-vitamin  D3,  and  prostanoids.  Many  of  these  NRs  are 
potential  targets  for  the  therapy  of  a  variety  of  diseases:  antag¬ 
onists  of  estrogen  receptor-a  (ERa)  (e.g.,  tamoxifen)  are  clin¬ 
ically  used  for  the  treatment  of  breast  cancer  (1)  whereas  retinoic 
acid  receptor  (RAR)  agonists  and  antagonists  block  the  growth 
of  a  number  of  neoplastic  cells  including  breast  tumor  cells  (2, 
3).  Agonists  for  retinoid  X  receptors  (RXRs)  and  peroxisome 
proliferator-activated  receptor  y  (PPARy)  are  potential  candi¬ 
dates  for  use  in  the  treatment  of  cancer  and  diabetes  (PPARy  is 
the  receptor  for  the  antidiabetic  drug  thiazolidinedione)  (4-7), 
whereas  Nurrl  ligands  may  be  useful  for  treatment  of  Parkin¬ 
son's  disease  (8).  Thus,  designing  molecules  that  selectively 
activate  or  inhibit  specific  NRs  is  of  considerable  biological 
significance  and  will  likely  have  the  potential  for  use  in  important 
clinical  applications. 

The  crystal  structures  of  the  ligand  binding  domain  (LBD)  of 
many  members  of  the  NR  family  recently  have  been  solved,  and 
the  ligand-dependent  structural  changes  involved  in  transcrip¬ 
tional  activation  have  been  clarified,  enabling  the  structure- 
based  design  of  specific  agonists  (9,  10).  Recent  studies  on  ERa 
also  have  shed  light  on  the  LBD  structural  changes  mediated  by 
NR  antagonists  (11,  12):  ERa  agonists  and  antagonists  super¬ 
impose  well  and  engage  in  a  very  similar  network  of  hydrophobic 
and  electrostatic  contacts  with  the  receptor.  However,  in  the 
agonist-bound  conformation,  the  C-terminal  helix  H12  sits  like 
a  lid  on  top  of  the  ligand  (11)  (a  similar  observation  was  made 
for  virtually  all  of  the  NR  LBD  structures  solved  so  far;  ref.  9). 
In  contrast,  the  two  ERa  antagonists  present  a  protruding  arm 
that  is  not  compatible  with  the  “closed  lid”  conformation  (11, 
12)  (Fig.  L A).  As  a  result,  helix  H12  is  pushed  away  from  the 
ligand  binding  site  and  relocates  in  the  coactivator-binding 
pocket  of  the  receptor  (Fig.  IB)  (11).  Moreover,  the  LxxML 
motif  (where  L  is  a  leucine,  M  a  methionine,  and  x  any  residue) 
of  the  ERa  helix  H12  mimics,  and  probably  competes  with,  a 
LxxLL  helical  peptide  found  in  a  wide  variety  of  coactivator 
proteins.  The  alignment  of  the  LBD  of  various  NRs  (13)  suggests 


that  a  common  structural  mechanism  would  be  for  the  antago¬ 
nists  to  induce  the  relocation  of  helix  HI  2  into  the  hydrophobic 
coactivator-binding  groove  of  the  receptor.  The  observation  that 
the  progesterone  receptor  antagonist  RU486  superimposes  with 
the  natural  hormone  progesterone,  but  presents  a  protruding 
arm  similar  to  that  of  tamoxifen  (14, 15)  provides  support  for  the 
universality  of  this  mechanism  of  antagonistic  activity. 

Our  goal  in  this  study  is  to  provide  further  evidence  for  this 
hypothesis  by  building  a  model  of  the  antagonist-bound  confor¬ 
mation  of  RARa,  a  NR  that  plays  an  important  role  in  the 
differentiation  and  proliferation  of  a  wide  variety  of  cell  types 
and  for  which  only  the  agonist  bound  conformation  is  known 
(16-18),  and  to  rationally  and  rapidly  identify  new  antagonists 
for  this  receptor.  We  built  a  model  of  the  antagonist-bound 
structure  of  RAR,  based  on  the  ERa/ tamoxifen  complex  (12). 
The  model  was  used  for  the  virtual  screening  of  a  database  of 
^150,000  available  compounds,  and  antagonist  candidates  were 
tested  in  vitro.  Two  novel  antagonists  and  a  novel  agonist  were 
discovered.  The  ligands  were  specific  for  RAR,  confirming  the 
validity  of  our  model  and  the  potential  therapeutic  application 
of  our  strategy. 

Materials  and  Methods 

Building  of  the  Model  of  Antagonist-Bound  RAR.  A  helical  peptide 
PLIREMLENP  corresponding  to  helix  H12  of  RARy  was 
docked  into  the  putative  coactivator  binding  pocket  of  another 
RARy  molecule.  We  hypothesized  that  the  IxxML  motif  con¬ 
tacts  the  coactivator  binding  site  of  the  receptor,  and  an  auto¬ 
matic  docking  procedure  was  carried  out  toward  this  site,  with 
flexible  protein  and  peptide  side  chains,  according  to  a  biased 
probability  Monte  Carlo  energy  minimization  procedure  (19, 
20).  Two  critical  features  of  the  interaction  between  the  LBDs  of 
NRs  and  their  coactivators  were  used  to  carry  out  the  docking: 
(/)  The  “charge  clamp,”  initially  observed  in  the  complex 
between  SRC-1  and  peroxisome  proliferator-activated  receptor 
y  (21),  where  a  conserved  glutamate  (E414  in  RARy)  and  lysine 
(K246  in  RARy)  at  opposite  ends  of  the  hydrophobic  cavity  of 
the  receptor  contact  the  backbone  of  the  coactivator's  LxxLL 
box,  enabled  the  orientation  of  the  helical  peptide,  (ii)  The 
finding  that  the  leucines  of  the  LxxLL  motif  of  SRC-1  are  buried 
in  the  hydrophobic  cavity  of  the  receptor  determines  which  side 
of  the  helix  faces  the  receptor.  Here,  the  isoleucine,  methionine, 
and  leucine  of  the  IxxML  motif  were  buried  in  the  binding  site 
of  RARy.  Loose  distance  restraints  were  set  between  the  charge 
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Fig.  1.  Modeling  of  the  antagonist-bound  structure  of  RAR.  Agonist  (white) 
and  antagomst  (cyan)  superimpose  in  the  binding  pocket  of  ERa,  but  the 
antagonist  presents  an  additional  protruding  arm  that  pushes  helix  1 2  (H 1 2, 
green)  away  (A).  As  a  result,  HI 2  relocates  in  the  coactivator  binding  pocket 
of  the  receptor  (HI  2,  red)  (6).  Based  on  the  ERa  structure,  helix  HI 2  of  RARy 
(red)  was  docked  to  the  coactivator  binding  pocket  of  the  RARy-LBD  (critical 
hydrophobic  residues  are  displayed  in  magenta)  (Q,  and  the  C  terminus  of  the 
protein  was  remodeled  from  its  agonist-bound  conformation  (green)  to  its 
antagonist-bound  conformation  (red)  (D). 

clamp  of  the  receptor  (21)  (i.e.,  E414  and  K246)  and  backbone 
nitrogen  and  oxygens  of  the  peptide  (nitrogen  of  the  isoleucine 
on  one  end,  and  carbonyl  of  the  methionine,  leucine,  and 
asparagine  in  the  MLEN  motifs,  respectively).  The  energy  of  the 
complex  was  minimized  in  the  internal  coordinate  space  by  using 
the  modified  ECEPP/3  potentials.  The  subset  of  the  variables 
minimized  with  the  icm  method  (19, 20, 22, 23)  included  the  side 
chains  of  the  receptor,  six  positional  variables  of  the  helix,  and 
the  side-chain  torsion  angles  of  the  helix. 

After  the  icm  docking  procedure,  we  built  a  model  of  antag¬ 
onist-bound  RARy.  The  structure  of  the  receptor  was  kept  rigid 
but  for  the  side  chains  and  backbone  of  the  25  C-terminal 
residues  (corresponding  to  the  last  10  residues  of  helix  Hll,  the 
loop  from  Hll  to  H12,  and  HI 2),  and  for  the  side  chains  of  the 
putative  coactivator  binding  site  (within  6  A  of  the  previously 
docked  helical  peptide).  Tethers  then  were  set  between  the  C 
terminus  of  the  receptor  and  the  corresponding  residues  of  the 
docked  helical  peptide,  and  the  energy  of  the  receptor  was 
minimized  by  a  stochastic  global  energy  optimization  in  the 
internal  coordinate  space  (22,  23). 

The  last  step  was,  from  the  resulting  model  of  antagonist- 
bound  RARy,  to  derive  the  structure  of  the  antagonist-binding 
pocket  of  RARa:  the  three  nonidentical  residues  in  the  vicinity 
of  the  binding  pocket  (A234,  M272,  and  A397)  were  changed  to 
the  RARa  isoform  (S234,  1272,  and  V397,  respectively)  and 
energy-minimized.  Another  possibility  would  have  been  to  in¬ 
troduce  the  mutations  before  remodeling  the  C  terminus  of  the 
receptor.  We  preferred  to  proceed  as  described  here  to  preserve 
the  integrity  of  the  receptor  during  the  critical  remodeling  of  the 
C-terminal  end. 

Receptor-Ligand  Docking.  An  initial  docking  was  carried  out  with 
a  grid  potential  representation  of  the  receptor  and  flexible 
ligand  (24).  The  resulting  conformation  then  was  optimized  with 


a  full  atom  representation  of  the  receptor,  flexible  receptor  side 
chains,  and  flexible  ligand,  by  an  icm  stochastic  global  optimi¬ 
zation  algorithm  as  implemented  in  the  MolSoft  icm  2.7  program 
(23,  24). 

Screening  of  a  Virtual  Library  of  Compounds.  The  flexible-ligand/ 
grid-potential-receptor  docking  algorithm  (23,  24)  was  carried 
out  automatically  on  the  Available  Chemicals  Directory  library 
of  153,000  available  chemical  compounds  (MDL  Information 
Systems,  San  Leandro,  CA).  The  screening  took  less  than  a 
month  on  10  194-MHz  IP25  processors.  Each  compound  was 
assigned  a  score,  according  to  its  fit  with  the  receptor,  which  took 
into  account  continuum  as  well  as  discreet  electrostatics,  hydro- 
phobicity,  and  entropy  parameters  (25).  The  distribution  of  the 
compounds  according  to  their  score  is  presented  at  http:// 
abagyan.scripps.edu/PNAS/MS2000/.  All  compounds  scoring 
better  (i.e.,  lower)  than  -32  were  screened  further  for  the 
number  of  hydrogen  bonds  engaged  with  the  receptor.  The  134 
compounds  that  made  at  least  two  hydrogen  bonds  with  the 
receptor  were  preselected.  The  609  compounds  scoring  better 
than  -37  also  were  preselected,  regardless  of  the  hydrogen 
bonding  network.  This  preselection  pool  then  was  further  min¬ 
imized  with  a  full  atom  representation  of  the  receptor,  as 
described  above.  The  quality  of  the  fit  of  the  500  best-scoring 
compounds  then  was  visually  estimated,  and  32  compounds  were 
selected  for  biological  testing.  These  compounds  are  not  neces¬ 
sarily  the  ones  with  the  best  final  scores,  but  the  ones  we  thought, 
after  careful  visual  inspection,  presented  the  best  characteristics, 
such  as  Van  der  Waals  fit  or  hydrogen  bonding  (see  http:// 
abagyan.scripps.edu/PNAS/MS2000/). 

It  occurred  to  us  that  during  the  selection  by  the  MolSoft 
virtual  screening  procedure,  it  was  preferable  to  set  up  an  initial 
cut-off  value  poorly  selective  (i.e.,  -32)  to  recover  a  large  pool 
of  preselected  compounds  and  to  apply  to  this  pool  subsequent 
screens  specific  for  the  system,  such  as  number  of  hydrogen 
bonds  (used  here)  or  presence  of  a  hydrogen  bond  acceptor  (for 
example)  at  a  specific  point  of  space.  As  a  result,  we  derived  the 
value  -32  as  a  good  initial  threshold  (this  value  generates  an 
initial  pool  of  3,000-4,000  compounds). 

Biological  Activity  of  the  Antagonist  and  Agonist  Candidates.  HeLa 
cells  were  transfected  by  calcium  phosphate  precipitation  using 
1  /xg  of  the  Gal4-responsive  chloramphenicol  acetyltransferase 
(CAT)  reporter  pMCl  10  and  1  /xg  of  Gal4-hRARa-LBD  or  1  /xg 
of  Gal4-hRXR/3-LBD.  Studies  also  were  performed  with  the 
three  wild-type  hRAR  isoforms  (hRARa,  hRAR/3,  and 
hRARy)  by  using  a  AMTV-IR-CAT  reporter  as  described  (26, 

27).  Cell  cultures  were  supplemented  with  indicated  ligands  gj  t 
immediately  after  addition  of  the  calcium  phosphate/DNA  §>  \ 
precipitate.  Media  and  ligands  were  replaced  after  24  h,  and  cells  §  \ 
were  harvested  and  essayed  for  CAT  activity  24  h  later.  w  v 

Results 

Modeling  of  the  RAR  Antagonist  Binding  Pocket.  The  x-ray  structure  * 
of  RARy  bound  to  the  agonist  all-trans  RA  is  available  (18);  ( 

however,  the  conformation  of  the  receptor  bound  to  an  antag-  J 
onist  is  not  known.  We  used  the  observations  made  from  the  \ 
structure  of  ERa  bound  to  an  agonist,  17/3-estradiol  (11),  and  \ 
two  antagonists,  tamoxifen  and  raloxifene  (11,  12),  to  build  a  | 
model  of  antagonist-bound  RAR  (Fig.  1  A  and  B).  We  docked  t 
helix  H12  of  RARy  into  the  putative  coactivator  binding  pocket 
of  the  receptor  as  described  (27)  (see  Materials  and  Methods  for 
details)  (Fig.  1C)  and  remodeled  the  25  C-terminal  residues, 
starting  near  the  end  of  helix  11,  through  an  extensive  global 
energy  minimization  procedure  (Fig.  ID). 

Docking  of  Known  RAR  Antagonists  into  the  Modeled  Receptor.  A  few 

RAR  antagonists  have  been  described  in  the  literature;  and 
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Fig.  2.  RAR  antagonists.  Two  known  antagonists  (A  and  B)  and  two  novel 
antagonists  (C  and  D).  {Left)  Chemical  structure.  {Right)  Conformation  docked 
into  the  receptor  (part  of  the  receptor  is  displayed  as  a  ribbon  representation, 
and  the  binding  pocket  boundary  is  displayed  in  yellow).  Cyan,  carbons;  red, 
oxygen;  blue,  nitrogen;  magenta,  fluorine;  yellow,  sulfur.  Hydrogens  are  not 
represented  for  clarity. 


several  of  them  are  serious  candidates  for  cancer  therapy  (2.  28). 
A  well-characterized  ligand  is  AGN1 93109,  which  inhibits  the 
three  RAR  isoforms  at  nanomolar  concentrations  (29).  Another 
very  potent  antagonist  is  MX781,  which  is  effective  against 
ERa-positive  and  -negative  breast  cancer  cells,  with  no  apparent 
toxicity  (2).  The  activity  of  these  two  ligands  has  been  presented 
in  detail,  but  no  structural  information  has  been  reported  on 
their  mode  of  interaction  with  the  receptor.  We  built  a  model  of 
RARy  complexed  either  with  AGN193109  or  MX781,  by  using 
our  flexible  docking  algorithm  (24)  (Fig.  2  A  and  B).  In  both 
cases,  the  antagonist  superimposed  with  the  agonist  all-trans 
RA.  As  observed  for  ERa,  the  antagonists  also  presented  a 
protruding  arm,  which  was  absent  in  RAR  agonists.  Very 
importantly,  this  protruding  arm  coincided  exactly  with  the 
single  opening  in  the  ligand  binding  pocket  of  our  modeled 
receptor,  generated  by  the  displacement  of  helix  H12  (Fig.  2  A 
and  B),  and  made  stabilizing  hydrophobic  contacts  with  the 
protein.  It  is  very  unlikely  that  this  perfect  fit,  observed  for  both 
antagonists,  was  fortuitous.  On  the  contrary,  this  feature  mimics 
the  inactivation  mechanism  revealed  by  the  crystal  structure  of 
ERa  bound  to  tamoxifen  and  raloxifene.  Therefore,  our  docking 
results  of  AGN1 93109  and  MX781  very  strongly  suggest  that:  (i) 
the  structural  mechanisms  of  antagonist  activity  for  ERa  are 
shared  by  other  NRs,  and  (ii)  our  model  of  the  RAR  antagonist 
binding  pocket  could  be  used  to  design  novel  antagonists. 

Screening  of  a  Virtual  Library  and  Discovery  of  Novel  RAR  Antagonists. 

High  throughput  functional  screening  currently  is  the  most  used 
method  for  the  discovery  of  receptor-specific  ligands.  Although 
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Fig.  3.  Functional  assays  of  the  novel  antagonists.  HeLa  cells  were  trans¬ 
fected  with  a  Gal4-hRARa-LBD  expression  vector  and  a  Gal4-CAT  reporter 
gene  (results  were  similar  in  studies  using  the  three  hRAR  isoforms).  The  cells 
were  incubated  with  5  nM  all-trans  RA  to  stimulate  CAT  activity,  and  the  effect 
of  each  antagonist  on  inhibiting  CAT  was  examined  at  2  and  20  p.M  concen¬ 
tration  (the  known  antagonist  RO-41-5253  was  used  as  a  positive  control). 


efficient,  it  requires  the  physical  availability  and  management  of 
hundreds  of  thousands  of  chemical  compounds.  In  the  present 
work,  we  used  a  virtual  library  composed  of  the  predicted 
structure  of  more  than  150,000  available  compounds  (see  Ma¬ 
terials  and  Methods).  Each  compound  was  automatically  docked 
in  a  grid  representation  of  the  modeled  RARa  antagonist 
binding  pocket.  Five  grid  potentials  carried  information  on  the 
shape,  hydrophobicity,  electrostatics,  and  hydrogen-bonding 
availability  of  the  receptor,  and  enabled  a  rapid  docking  simu¬ 
lation  (24,  25).  RARa  was  selected  over  the  other  two  isoforms 
(RAR/3  and  RARy)  because  recent  data  suggests  it  could  be  a 
medically  more  relevant  target  (28).  After  an  automatic  selection 
procedure  with  flexible  ligands,  and  optimization  of  the  selected 
candidates  with  flexible  protein  side  chains  (see  Materials  and 
Methods  for  details),  32  compounds  were  considered  as  potential 
antagonists  of  RARa  and  ordered. 

To  test  these  compounds  in  vitro ,  HeLa  cells  were  transfected 
with  a  Gal4-hRARa-LBD  expression  vector  and  a  Gal4-CAT 
reporter  gene  (26).  Studies  also  were  performed  with  the  three 
wild-type  hRAR  isoforms  and  a  AMTV-IR-CAT  reporter  (26, 
27).  These  gave  similar  results  as  those  found  with  Gal4- 
hRARa-LBD  (data  not  shown).  The  cells  were  incubated  with 
all-trans  RA  to  stimulate  CAT  activity,  and  the  effect  of  each 
antagonist  candidate  on  inhibiting  CAT  stimulation  by  all-trans 
RA  was  examined.  Possible  toxicity  of  the  compounds  was 
deduced  from  the  amount  of  cellular  protein  extract  after  2  days 
of  incubation.  Two  antagonist  candidates  inhibited  CAT  activity 
by  55%  and  33%  at  20  pM  with  no  apparent  toxicity  (Fig.  3).  The 
Gal4-hR  ARa  activity  illustrated  in  Fig.  3  was  equivalent  for  the 
other  two  RAR  isoforms  (data  not  shown).  No  inhibition  was 
observed  when  CAT  expression  was  under  the  control  of  a 
Gal4-mRXRj3-LBD  fusion  construct,  indicating  that:  (i)  the 
antagonists  are  specific  for  RAR,  and  (ii)  the  inhibition  is  caused 
by  an  interaction  with  the  Gal4-RAR-LBD  fusion  protein  and 
does  not  result  from  some  nonspecific  effect  on  CAT  activity 
(data  not  shown). 

The  two  RAR  antagonists  dock  into  the  ligand  binding  pocket 
of  the  receptor  (Figs.  2  C  and  D  and  4).  As  observed  for 
AGN1 93109  and  MX781,  they  fit  in  the  same  binding  pocket  as 
the  natural  agonist  all-trans  RA,  but  present  an  additional  arm, 
which  protrudes  out  of  the  pocket.  Antagonist  1  has  a  tri-f  luoro 
group  where  the  retinoid  receptor  ligands  usually  carry  a  car- 
boxylate  group  (in  antagonist  2,  the  corresponding  domain  is 
truncated).  In  our  model,  antagonist  2  engages  in  a  hydrogen 
bond  with  Ser-234  of  the  hRARa  (Fig.  4 B).  Flowever,  the  S234A 
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Fig.  4.  Novel  RAR  antagonists.  {A  and  B)  Stereo  representation  of  antago¬ 
nists  1  and  2  docked  into  the  binding  site  of  the  receptor.  The  ligands  make 
extensive  hydrophobic  interactions  with  residues  from  helix  3,  helix  5,  and 
helix  11.  Antagonist  2  ( B )  is  engaged  in  an  additional  hydrogen  bond  with 
Ser-234  of  helix  3  and  contacts  the  remodeled  C  terminus  (red)  at  Pro-405.  (C 
and  D)  The  fit  of  antagonists  1  and  2  into  the  receptor  binding  pocket  is  shown. 


mutation  in  the  other  two  isoforms  does  not  alter  the  ligand 
antagonist  activity,  suggesting  that  this  hydrogen  bond  is  not 
essential  for  the  interaction.  An  obvious  way  to  increase  the 
affinity  of  these  antagonists  would  be  to  substitute  the  tri-fluoro 
group  by  a  carboxylate  in  antagonist  1  or  elongate  and  add  a 
carboxylate  to  antagonist  2,  which  would  result  in  more  stabi¬ 
lizing  interactions  with  two  conserved  arginines  of  the  receptor. 
However,  the  purpose  of  this  work  is  to  provide  evidence  that  the 
rational  design  of  antagonists  from  the  model  of  the  inactive 
receptor  is  feasible  and  not  to  optimize  the  affinity  of  the 
compounds.  The  in  vitro  functional  assays  provide  evidence  that 
our  modeling  scheme  is  relevant  and  can  be  used  to  design  novel 
antagonists  of  NRs. 

We  applied  the  same  strategy  to  discover  agonists,  by  using  the 
crystal  structure  of  the  active  conformation  of  RARy  (18),  and 
could  discover  three  novel  agonists  10-25%  active  at  200  nM  and 
fully  active  at  20  ptM,  of  30  compounds  tested  (data  not  shown). 

Screening  of  a  Database  of  Known  Ligands.  To  assess  the  quality  of 
our  setup  of  the  icm  screening  algorithm  (23),  we  built  a  small 


virtual  database  made  up  of  antagonists  and  agonists  for  differ¬ 
ent  members  of  the  NR  family  (Table  1).  We  screened  this 
database  with  our  model  of  antagonist-bound  RAR,  as  we  did 
for  the  Available  Chemicals  Directory  database.  The  screening 
was  repeated  four  times,  to  test  the  reproducibility  of  our 
method.  Table  1  shows  that  for  each  ligand  the  score  varies  a  lot 
from  one  screening  to  the  other.  This  finding  reflects  the 
generation  of  different  ligand  conformations  from  one  docking 
simulation  to  another  (data  not  shown)  and  represents  the 
limitation  of  our  method,  as  discussed  below. 

Table  1  lists  as  “selected”  the  ligands  that  met  with  the  criteria 
for  preselection  and  final  inspection  during  the  Available  Chem¬ 
icals  Directory  screening  (i.e.,  score  better  than  -37  or  score 
better  than  -32  and  at  least  two  hydrogen  bonds  with  the 
receptor;  see  Materials  and  Methods  for  details).  Seven  of  the 
nine  known  RAR  ligands  (i.e.,  ^80%)  and  one  of  the  six 
non-RAR  ligands  (i.e.,  ^16%>)  were  selected.  The  fact  that  RAR 
agonists,  as  well  as  antagonists,  produced  good  scores  was 
expected,  because  the  binding  pocket  used  for  the  screening  is 
equivalent  to  the  agonist  binding  pocket,  with  an  additional 
opening  generated  by  the  remodeling  of  the  C  terminus  of  the 
receptor.  The  two  false  negatives,  AGN193836  and  Ro415253, 
were  missed  because  of  steric  clashes,  as  discussed  below. 
Antagonist  1  was  not  found  either,  reflecting  its  rather  low 
affinity  for  the  receptor.  It  is  important  to  underline  here  that 
we  do  not  expect  to  detect  all  of  the  true  binders.  The  algorithm 
was  rather  designed  to  minimize  the  number  of  false  positives, 
which  correlates  with  the  number  of  unnecessary  in  vitro 
experiments  (25). 

In  that  respect,  the  presence  of  one  false  positive  of  six 
nonbinders  could  be  alarming,  because  such  a  ratio  would 
represent  about  25,000  false  positives  of  a  database  of  150,000 
compounds.  However,  the  binding  pockets  of  the  NRs  repre¬ 
sented  in  this  database  are  close  in  size  and  shape;  as  a  result,  the 
database  used  for  this  benchmark  was  composed  of  molecules 
presenting  strong  similarities  with  RAR  ligands.  Therefore,  we 
believe  this  ratio  is  not  representative.  The  fact  that  we  needed 
to  test  only  32  molecules  to  discover  three  novel  RAR  ligands 
confirms  this  assumption. 

Next,  we  tried  to  address  why  some  ligands,  such  as  Ro415253, 
were  repeatedly  missed  by  our  screening  algorithm  (Ro415253 
was  still  not  selected  after  10  docking  simulations,  data  not 
shown).  We  hypothesized  that  the  ligand  could  not  fit  into  the 
potential  maps  generated  from  our  model  and  carried  out  a 
docking  simulation  with  a  full  atom  representation  of  the 
receptor,  according  to  a  Monte  Carlo  energy  minimization  of  the 
complex,  with  both  flexible  ligand  and  flexible  receptor  side 
chains  (24).  This  docking  simulation  produced  a  solution  were 
the  ligand  fits  into  the  binding  pocket;  the  core  of  the  ligand 
(from  the  carboxylate  to  the  internal  sulfone)  superimposes  with 
agonists  such  as  all-trans  RA,  whereas  the  alkyl  arm  sticks  out 
of  the  pocket,  as  previously  described  for  the  other  antagonists  j  ; 
(data  not  shown).  The  conformation  of  several  receptor  side  T 
chains  was  modified  during  the  docking  simulation,  to  accom¬ 
modate  the  size  of  the  ligand,  and  this  solution  would  not  have  1 
been  found  with  rigid  side  chains.  This  finding  suggests  that 
Ro415253  could  not  fit  into  the  potential  maps  generated  from 
the  original  receptor  conformation,  which  we  used  for  the 
screening.  We  generated  a  new  series  of  potential  maps  from  the 
optimized  receptor  structure  and  screened  the  small  database  of 
known  ligands  with  these  maps  four  times  as  above  (Table  1). 

The  score  assigned  to  Ro415253  was  twice  lower  (i.e.  better) 
than  the  threshold.  Surprisingly,  this  new  series  of  potential  maps 
totally  eliminated  the  presence  of  both  false  positive  and  false 
negative  (all  RAR  ligands  and  only  RAR  ligands  were  selected). 
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Table  1.  Control  screening  of  known  NR  ligands 


Ligand 

Activity 

Score  1 

Score  2 

Score  3 

Score  4 

Selected 

Binding 

References 

First  series 

AGN193836 

RAR_agonist 

-19.9 

-9.04 

-20.6 

-19.7 

+ 

(33) 

ATRA 

RAR  pan-agonist 

-46.4 

-41 

-41.7 

-41. 

+ 

+ 

(34) 

Ro41 5253 

RAR_antagonist 

-25.5 

-22. 

-28.3 

-28.6 

- 

+ 

(28) 

MX781 

RAR  antagonist 

-28. 

-23.9 

-27.1 

-36.4 

+ 

+ 

(2) 

CD2366 

RAR  pan-antagonist 

-28.5 

-23.3 

-30.9 

-32.3 

+ 

+ 

(34) 

Targ  retin 

RXR  pan-agonist 

-17.9 

-18.1 

-19.1 

-18.6 

- 

- 

(4) 

SR 11 203 

RXR  pan-agonist 

-27.5 

-27. 

-27. 

-27.2 

- 

- 

(34) 

Tamoxifen 

ER  modulator 

-29.3 

-27.5 

-29.8 

-28.3 

- 

- 

(23) 

Raloxifene 

ER  modulator 

-23.4 

-20.8 

-26.7 

-34.6 

+ 

- 

(22) 

RU486 

Progest  Rec  antag. 

-21.2 

-21.3 

-21.4 

-21.3 

- 

- 

(25) 

9cisRA 

RAR/RXR  agonist 

-32.5 

-32.6 

-32.9 

-16.9 

+ 

+ 

(34) 

AGN193109 

RAR  pan-antagonist 

-39.2 

-56. 

-57.4 

-39.4 

+ 

+ 

(29) 

AGNpartia 

RAR  partial  agonist 

-54.4 

-54.3 

-49.5 

-29.1 

+ 

+ 

(29) 

Am580 

RAR_agonist 

-34.2 

-34.4 

-34.8 

-34.5 

+ 

+ 

(34) 

EM652 

ER  antagonist 

-27. 

-27.4 

-21.7 

-28.8 

- 

- 

(35) 

Antagonist  1 

Novel  RAR  antag. 

-28.5 

-28.1 

-28.7 

-28.8 

- 

+ 

(35) 

Antagonist  2 

Novel  RAR  antag. 

-27.6 

-38.9 

-40.2 

-26.3 

+ 

+ 

(35) 

Second  series 
AGN193836 

RAR_agonist 

-37.2 

-36.5 

-36.7 

-35.3 

+ 

+ 

(33) 

ATRA 

RAR  pan-agonist 

-51.7 

-52.6 

-51.8 

-52.0 

+ 

+ 

(34) 

R0415253 

RAR„antagonist 

-28.9 

-24.4 

-39.0 

-46.6 

+ 

+ 

(28) 

MX781 

RAR  antagonist 

-45.3 

-48.0 

-40.2 

-45.6 

+ 

+ 

(2) 

CD2366 

RAR  pan-antagonist 

-50.7 

-50.8 

-29.3 

-29.3 

+ 

+ 

(34) 

Targretin 

RXR  pan-agonist 

-25.4 

-23.0 

-22.2 

-31.0 

- 

- 

(4) 

SR1 1203 

RXR  pan-agonist 

-28.2 

-22.7 

-22.1 

-27.5 

- 

- 

(34) 

Tamoxifen 

ER  modulator 

-26.4 

-24.6 

-30.3 

-23.4 

- 

- 

(23) 

Raloxifene 

ER  modulator 

-15.6 

-23.7 

-18.4 

-17.4 

- 

- 

(22) 

RU486 

Progest  Rec  antag. 

-21.4 

-20.6 

-20.3 

-20.1 

- 

- 

(25) 

9cisRA 

RAR/RXR  agonist 

-38.8 

-39.5 

-33.5 

-38.7 

+ 

+ 

(34) 

AGN193109 

RAR  pan-antagonist 

-55.1 

-55.5 

-41.2 

-54.8 

+ 

+ 

(29) 

AGNpartia 

RAR  partial  agonist 

-61.4 

-61.3 

-61.4 

-61.0 

+ 

+ 

(29) 

Am  580 

RAR„agonist 

-46.6 

-47.2 

-46.6 

-46.5 

+ 

+ 

(34) 

EM652 

ER  antagonist 

-26.3 

-23.1 

-23.7 

-27.3 

- 

- 

(35) 

Antagonist  1 

Novel  RAR  antag. 

-32.1 

-32.1 

-31.7 

-31.6 

+ 

+ 

(35) 

Antagonist  2 

Novel  RAR  antag. 

-33.3 

-29.7 

-33.8 

-33.8 

+ 

+ 

(35) 

First  series:  A  similar  screening  as  the  one  performed  on  the  ACD  database  was  carried  out  four  times  on  a  small  database  made  of  known  RAR  antagonists, 
agonists,  as  well  as  ligands  for  other  NRs  and  the  two  novel  RAR  antagonists.  The  ligands  that  met  at  least  once  with  the  criteria  for  selection  used  during  the 
ACD  screening  are  listed  as  Selected.  The  ligands  that  are  experimentally  binding  to  RAR  are  listed  as  Binding.  Second  series:  Screening  of  known  ligands  after 
adjustment  of  the  receptor's  binding  pocket  conformation.  The  RAR  antagonist  Ro41 5253  was  docked  into  our  model  of  antagonist-bound  RAR  with  flexible 
receptor  side  chains  and  ligand.  The  resulting  receptor  conformation  was  used  for  a  novel  screening. 


Discussion 

In  this  study,  we  presented  a  strategy  for  the  discovery  of 
antagonists,  as  well  as  agonists,  for  NRs,  which  are  very  impor¬ 
tant  targets  for  drug  design.  An  important  aspect  of  our  ap¬ 
proach  was  to  exclude  any  preconceived  pharmacophore  bias 
from  our  database  screening.  Most  drug  design  strategies  impose 
chemical  constraints  on  the  selected  molecule  to  conserve  the 
functional  groups  believed  to  be  most  important  in  existing 
ligands,  preventing  the  discovery  of  novel  ligand  types.  In  the 
present  work,  we  avoided  pharmacophore  constraints  thanks  to 
a  robust  flexible  docking  program  and  scoring  function:  the  only 
filters  used  for  screening  were  a  good  fit  with  the  receptor  and 
reasonable  bioavailability  parameters  (30).  As  a  result,  we 
discovered  novel  original  ligands  that  could  be  further  optimized 
into  potent  RAR-selective  antagonists  and  agonists. 

A  limitation  of  our  method,  which  leaves  room  for  further 
improvement,  is  that  a  compromise  must  be  made  between  the 
time  allocated  for  each  ligand  (less  than  2  min  on  one  processor 
here)  and  the  reliability  of  the  sampling  of  the  conformational 
space.  Indeed,  Table  1  shows  that  four  runs  for  each  ligand  are 
necessary  to  minimize  efficiently  missed  hits  (the  remaining 


missed  positives  were  not  selected  because  of  inappropriate 
receptor  side-chain  conformations  and  not  because  of  an  insuf¬ 
ficient  sampling).  Improvement  of  the  computing  power,  the 
docking  algorithm,  and  the  scoring  function  all  could  result  in  a 
more  robust  virtual  database  screening. 

Another  drawback  is  that  the  conformation  of  the  receptor  is 
not  necessarily  unique,  but  can  vary  from  one  ligand  to  another. 
As  a  result,  a  ligand  that  fits  in  receptor  conformation  A  will 
never  be  found  if  receptor  conformation  B  is  used  for  the 
screening.  The  case  of  Ro415253  illustrates  this  issue  well:  this 
known  antagonist  was  never  selected,  even  after  10  trials, 
because  the  binding  pocket  used  for  the  screening  was  too 
narrow.  The  potential  maps  used  for  the  screening  have  a 
smoother  van  der  Waals  profile  than  the  atomic  representation 
of  the  receptor;  as  a  result,  the  maps  are  more  tolerant  regarding 
steric  clashes  with  the  ligand.  However,  the  degree  of  tolerance 
is  limited  and  cannot  accommodate  important  conformational 
changes  of  the  receptor  side  chains  (or  backbone,  obviously). 
When  new  potential  maps  generated  from  a  model  of  RAR 
bound  to  Ro415253  were  used  for  screening,  the  three  RAR 
ligands  missing  from  the  first  screening  were  selected  (Table  1). 
This  finding  confirms  that  the  initial  conformation  of  the 
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receptor  prevented  the  selection  of,  or  reduced  the  chances  of 
selecting,  some  known  RAR  ligands.  The  false  positive  ralox¬ 
ifene  (Table  1)  was  making  extensive  van  der  Waals  interactions 
with  the  narrow  RAR  binding  pocket,  which  compensated  for 
the  lack  of  stabilizing  electrostatic  interactions.  However,  in  the 
new  conformation  of  the  receptor  (Table  1),  the  binding  pocket 
is  wider  and  the  fit  not  as  tight.  As  a  result,  raloxifene  was  not 
selected.  This  observation  emphasizes,  if  necessary,  that  virtual 
screening  is  very  sensitive  to  the  conformation  of  the  receptor. 

In  that  respect,  it  is  interesting  to  note  that  the  topology  of  the 
remodeled  C-terminal  loop  is  probably  not  unique,  and  that  the 
conformation  used  to  generate  the  receptor  potential  maps  was 
one  among  many  others.  It  is  therefore  legitimate  to  wonder 
whether  novel  antagonists  could  not  be  discovered  as  efficiently 
from  a  structure  of  the  receptor  where  the  C  terminus,  instead 
of  being  remodeled,  was  truncated.  This  brings  up  a  fundamental 
question:  is  the  role  of  antagonists  only  to  antagonize  the  “closed 
lid”  conformation  where  helix  H12  sits  on  top  of  the  ligand 
binding  pocket,  or  are  they  also  stabilizing  the  inactive  confor¬ 
mation  of  the  receptor?  It  is  important  to  keep  it  mind  that  the 
C-terminal  tail  of  RAR  (as  well  as  for  other  NRs)  is  a  very 
dynamic  entity  when  no  ligand  is  bound  to  the  receptor  and 
probably  oscillates  between  active  and  inactive  conformations. 
Once  bound  in  the  ligand  binding  pocket,  agonists  contact  the 
H12  helix  and  lock  the  receptor  in  its  coactivator-binding 
conformation.  Likewise,  it  is  reasonable  to  speculate  that  an¬ 
tagonists  would  contact  the  C-terminal  tail  of  the  receptor  and 
stabilize  the  inactive  state.  However,  it  is  probable  that  the 
conformation  of  the  receptor  varies  from  one  ligand  to  another; 
indeed,  recent  results  on  ERa  show  that  different  ligands  induce 
distinct  conformational  change  of  the  receptor  (31).  We  used  the 
crystal  structure  of  ERa  bound  to  tamoxifen  to  build  our  model 
of  inactive  RAR  and  could  find  two  specific  antagonists,  one  of 
which  contacts  the  remodeled  tail  of  the  receptor.  Although  the 
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conformation  we  used  for  the  C-terminal  tail  was  probably  not 
the  only  possible  one,  we  believe  that  its  presence  was  important 
to  bias  the  screening  toward  compounds  that  actually  do  contact 
the  flexible  arm  of  RAR,  as  well  as  to  impose  a  reasonable 
boundary  on  the  antagonist  binding  pocket,  and  prevent  the 
ligands  from  drifting  out  of  the  pocket  during  the  docking 
simulations. 

An  important  point  was  to  demonstrate  that  we  could  discover 
novel  antagonists  for  a  NR  other  than  ERa,  provided  that  the 
structure  of  the  agonist-bound  active  form  of  the  protein  was 
known.  Rational  design  of  ligands  from  a  model  of  a  receptor  is 
thought  by  many  to  yield  very  low  success  rates.  The  present 
study  demonstrates  that  this  strategy  can  be  successfully  under¬ 
taken  with  appropriate  biological  systems  and  robust  modeling 
tools.  Moreover,  targeting  models  of  diverse  members  of  the  NR 
family  could  be  further  justified  by  the  wealth  of  structural  and 
sequence  information  (9,  13),  as  well  as  the  finding  that  NR 
family  members  share  similar  mechanisms  of  transcriptional 
activation  and  inhibition  (9). 

The  recent  publication  of  the  crystal  structures  of  medically 
relevant  receptor  targets,  such  as  peroxisome  proliferator-activated 
receptor  y  (21),  RAR  (18),  RXR  (32),  ERa  (11),  or  progesterone 
receptor  (15),  has  created  an  exciting  opportunity  for  the  discovery 
of  novel  ligands.  This  study  demonstrates  that  the  rational  design  of 
both  antagonists  and  agonists,  by  using  computer-generated  models 
based  on  these  structures,  is  possible. 
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Abstract 


Several  RAR  agonists  have  therapeutic  activity  against  a  variety  of  cancer  types;  however, 
unacceptable  toxicity  profiles  have  hindered  the  development  of  drugs.  RAR  agonists  presenting 
novel  structural  and  chemical  features  could  therefore  open  new  avenues  for  the  discovery  of 
leads  against  breast,  lung  and  prostate  cancer  or  leukemia.  We  have  applied  a  high  throughput 
flexible  internal  coordinate  docking  approach  (ICM)  to  discover  such  compounds.  A  library  of 
over  150.000  molecules  was  docked  in  silico  to  the  structure  of  the  receptor  and  30  ligand 
candidates  were  tested  in  vitro.  Three  novel  agonists  were  identified,  one  of  them,  agonist  3, 
presenting  two  original  features:  (i)  a  penta-methylated  benzene  linked  by  a  ketone  to  a  second 
aromatic  ring,  forming  a  very  large  hydrophobic  head,  (ii)  a  pyridine  replacing  a  carboxylate  or 
ester  present  in  all  RAR  ligands  described  so  far.  These  novel  structural  features  may  translate 
into  improved  toxicity  profiles  and  result  in  the  development  of  new  ligands  for  cancer  therapy. 


Introduction 


The  retinoic  acid  receptors  (RAR-a,  -|3,  and  -y)  are  transcription  factors  regulating  a  variety  of 
endocrine  metabolic  pathways.  Unlike  anti-estrogens,  such  as  tamoxifen  or  raloxifene,  ligands 
targeted  against  the  RAR  isoforms  can  present  anticancer  activity  against  both  estrogen  receptor 
positive  and  negative  breast  tumor  cells1.  As  a  result,  such  molecules  could  constitute  a  novel 
generation  of  drugs  against  breast  cancer.  For  reasons  not  yet  clear,  both  agonists  and  antagonists 
of  RAR  can  present  anti-tumor  activity  against  breast,  prostate,  lung  cancer  or  leukemia1'7.  The 
development  of  both  types  of  ligands  could  therefore  have  important  biomedical  implications. 
We  have  recently  demonstrated  that  antagonists  could  be  discovered  rationally,  based  on  a  model 
of  the  antagonist-bound  conformation  of  the  receptor8.  Our  goal  here  is  to  discover  innovative 
molecular  structures  with  RAR  agonist  activity. 

Several  retinoid  and  non-retinoid  ligands  have  been  described,  which  activate  one  or  a 
combination  of  RAR  isoforms.  Some  of  them,  such  as  the  natural  ligand  all-trans  retinoic  acid 
(all-trans  RA),  have  been  tested  clinically,  and  display  unacceptable  side  effects,  such  as  skin 
dryness,  cheilitis,  hypertriglycemia  and  conjunctivitis9'10.  However,  the  compounds  tested  so  far 
belong  to  limited  series  of  related  structures.  An  increasing  amount  of  data  suggest  that  the 
RARa  isoform,  which  controls  the  expression  of  RARp,  is  the  most  relevant  target  for 
anticancer  therapy1115.  Innovative  molecules  with  RARa  agonist  activity  could  therefore  present 
more  favorable  toxicity  profile. 

We  applied  a  flexible  virtual  screening  algorithm  (Molsoft  ICM,  virtual  screening  module16) 
which  rapidly  docks  hundreds  of  thousands  of  flexible  compound  structures  into  the  ligand 
binding  pocket  of  RARa  and  discovered  three  novel  RAR  pan-agonists.  At  least  one  ligand  has 
unexpected  structural  and  chemical  characteristics,  which  could  be  used  in  the  development  of 
novel  compounds  for  cancer  prevention  and  therapy. 


Results  and  discussion 

We  first  built  a  model  of  the  RARa  agonist  binding  pocket  from  the  crystal  structure  of  the 


RARy  ligand  binding  domain  (RARy  LBD)  /  all-trans  RA  complex17.  All  but  three  amino  acids 
in  the  vicinity  of  the  ligand  are  conserved  between  the  two  isoforms.  These  three  non-identical 
residues  -A234,  M272,  and  A397-  were  changed  to  the  RARa  isoform  -S234, 1272  and  V397- 
and  energy  minimized  (see  "Experimental  Section"). 

In  order  to  address  the  accuracy  of  our  model  of  the  RARa  binding  pocket,  we  docked  AM580, 
an  RARa  specific  agonist18,  into  the  receptor.  A  rapid  docking  procedure  with  flexible  ligand 
and  a  grid  representation  of  the  receptor  was  followed  by  an  extensive  Monte  Carlo  energy 
minimization  with  both  ligand  and  receptor  side  chains  flexible  (see  "Experimental  Section"  for 
details).  The  ligand  superimposed  well  with  the  natural  ligand  all-trans  RA  (Figure  la). 
Interestingly,  Am580  does  not  seem  to  fit  in  the  receptor  binding  site:  the  ketone  oxygen  of  the 
ligand  sticks  out  of  the  binding  pocket,  due  to  too  close  proximity  of  residue  234  (Figure  lb). 
However,  in  the  complex  with  RARa,  this  ketone  oxygen  actually  shares  an  hydrogen  atom  with 
the  hydroxyl  group  of  serine  234,  and  forms  a  stabilizing  hydrogen  bond,  while  a  steric  clash 
occurs  in  the  other  two  RAR  isoforms,  where  residue  234  is  an  alanine.  Consequently,  our  model 
provides  a  rational  for  Am580  isoform  specificity,  suggesting  that  it  is  relevant  and  could  be 
used  as  a  template  for  the  discovery  of  novel  RARa  agonist  structures. 

A  high  throughput  virtual  screening  was  carried  out  on  the  Available  Chemicals  Directory  (MDL 
Information  Systems,  San  Leandro,  CA),  a  compound  structure  database  of  over  150,000 
molecules.  Each  compound  was  automatically  docked  into  a  grid  representation  of  RARa,  as 
previously  described  for  Am580,  and  assigned  a  score  according  to  the  quality  of  the  fit19,16.  The 
5364  ligand  candidates  which  scored  better  (i.e.  lower)  than  -32  kcal/mol  were  pre-selected  for  a 
more  refined  energy  minimization  procedure,  with  flexible  receptor  side  chains  (see 
"Experimental  Section"  for  details).  After  careful  visual  examination,  30  molecules  were  selected 
and  purchased  to  be  experimentally  tested  in  vitro. 

HeLa  cells  were  transfected  with  a  Gal4-hRARa-LBD  expression  vector  and  a  Gal4-CAT 
reporter  gene20.  The  cells  were  incubated  with  each  ligand  at  0.1,  1  and  10  pM  to  stimulate  CAT 
activity.  Possible  toxicity  of  the  compounds  was  deduced  from  the  amount  of  cellular  protein 
extract  after  2  days  of  incubation.  The  percentage  of  conversion  induced  by  all-trans  RA  at  100 


nM  was  used  as  a  positive  control  for  maximum  induction.  Two  agonist  candidates  (agonist  1 
and  agonist  2)  induced  92%  and  98%  of  the  maximum  CAT  activity  at  10  pM,  and  23%  and 
88%  of  maximum  CAT  activity  at  1  pM  respectively,  with  no  apparent  toxicity  (Figure  2). 
Agonist  2,  in  particular,  was  a  rather  good  agonist:  it  still  could  induce  44%  of  maximum  CAT 
activity  at  200  nM.  The  Gal4-hRARa-LBD  activity  was  similar  to  the  other  two  RAR  isoforms, 
and  studies  performed  with  the  three  wild  type  hRAR  isoforms  and  a  AMTV-ER-CAT20’21 
reporter  confirmed  the  results  (data  not  shown).  The  compounds  could  also  activate  a  Gal4- 
mRXR(3-LBD  fusion  construct  at  10  pM:  60%  and  20%  of  maximal  CAT  activity  was  induced 
by  Agonist  1  and  2  respectively  (data  not  shown). 

Agonist  1  and  2  both  have  a  carboxylate  group  which  superimposes  with  the  carboxylate  of  all- 
trans  RA,  and  make  stabilizing  hydrogen  bonds  with  Arg  274  and  278,  and  the  backbone 
nitrogen  of  Ser  289  (Figure  3).  An  additional  electrostatic  interaction  is  probably  present 
between  Ser  234  and  the  thiazole  nitrogen  of  agonist  1.  However,  it  is  not  necessary  for  the 
activity  of  the  compound  since  residue  234  is  an  alanine  in  RARP  and  RARy,  and  no  difference 
of  activity  is  observed  between  the  three  isoforms.  All  other  interactions  between  the  receptor 
and  RARa  are  hydrophobic.  As  a  result,  the  size  of  the  ligand  and  the  shape  complementarity 
with  the  receptor  are  critical  for  affinity  and  specificity.  In  that  respect,  it  is  interesting  to  note 
that,  while  agonist  1  is  close  in  size  to  all-trans  RA,  agonist  2  is  substantially  larger,  thanks  to  its 
bulky  head  composed  of  two  tertio-butyl  groups.  This  feature  probably  underlies  the  observation 
that  agonist  2  is  more  active  than  agonist  1  (Figure  2). 

We  recently  described  a  similar  virtual  screening  procedure  carried  out  on  a  model  of  antagonist- 
bound  RAR,  where  the  C-terminal  helix  H12  was  translocated  onto  the  coactivator  binding  site 
of  the  receptor,  in  an  effort  to  discover  novel  RAR  antagonists8.  One  of  the  antagonist  candidates 
could  fit  into  our  model  of  active  RARa,  and  presented  substantial  agonist  activity:  it  induces 
28%  of  maximum  CAT  activity  at  1  pM  and  73%  at  10  pM  (Figure  2,  agonist  3).  This 
compound  is  much  larger  than  all-trans  RA  and  agonists  1  and  2  (Figure  3).  Agonist  3  did  not 
activate  RXR  at  concentrations  tested  (up  to  20  pM)  (data  not  shown). 


To  our  knowledge,  virtually  all  RAR  agonists  described  in  the  literature  have  three  characteristic 


structural  features:  (i)  A  bulky  hydrophobic  head,  such  as  the  poly-methylated  ring  of  all-trans 
RA  or  the  tetra-methyl  tetra-hydro  naphthalene  of  AM58018.  (ii)  An  elongated,  mostly 
hydrophobic  linker  about  10  A  long  (10.8  A  for  all-trans  RA).  (iii)  A  carboxylate  or  ester  end, 
which  makes  specific  electrostatic  contacts  with  Arg  274  and  278,  and  Ser  289  (Figure  3A  and 
3B).  Figure  3  shows  that  both  agonist  1  and  agonist  2  present  these  characteristics.  Agonist  2  is 
actually  very  close  to  Ch55,  an  already  known  RAR  agonist22.  However,  it  is  worth  mentioning 
that  the  tri-fluoro  group  at  the  head,  and  the  thiazole  ring  in  the  core  of  agonist  1  constitute 
original  chemical  structures,  for  RAR  ligands. 

Unlike  agonist  1  and  agonist  2,  the  hydrophobic  head  of  agonist  3  is  surprisingly  composed  of 
two  aromatic  moieties,  linked  by  a  ketone  group  (Figure  3C).  A  bipartite  head  was  already 
observed  in  compounds  with  an  adamantane  moiety,  such  as  the  RARy  selective  agonist 
CD43723.  However,  the  penta-methylated  ring  in  agonist  3  and  the  ketone  linking  the  two 
aromatic  moieties  constitute  to  our  knowledge  an  entirely  novel  structural  feature,  which  fits 
tightly  into  the  RAR  binding  pocket,  and  makes  extensive  Van  der  Waals  and  hydrophobic 
contacts  with  the  receptor.  Another  unexpected  feature  of  agonist  3  is  that  its  linker  region  is 
hydrophilic  which  is  unfavorable  to  the  stability  of  the  complex  (most  of  the  polar  groups  of  the 
linker  are  surrounded  by  hydrophobic  residues),  and  decreases  the  affinity  of  the  ligand  for  the 
receptor.  Consequently,  some  chemical  modifications  of  agonist  3,  such  as  replacing  the 
nitrogens  of  the  linker  by  carbons,  could  result  in  a  molecule  with  improved  affinity  for  the 
receptor. 

The  third  and  most  surprising  characteristic  of  agonist  3  is  that,  to  our  knowledge,  it  is  the  first 
RAR  ligand  described  so  far  which  lacks  a  carboxylate  group  (some  heteroarotinoids  have  been 
described  with  a  methyl  or  ethyl  ester  at  the  carboxylate  position,  but  this  group  might  be 
processed  in  vivo  into  the  carboxylate24,25.  Moreover,  docking  experiments  suggest  that  the 
nature  of  the  interaction  with  arginine  278  of  the  receptor  is  conserved  (data  not  shown)).  A 
pyridine  in  agonist  3  is  substituted  to  the  carboxylate,  and  the  nitrogen  of  the  pyridine  could  be 
engaged  in  a  pyridinium  thiolate  salt  bridge  with  the  side  chain  of  Cys237  (the  distance  between 
the  nitrogen  and  the  sulfur  is  only  3.5  A  in  our  model).  This  interaction  is  comparable  in  strength 
to  the  carboxylate/arginines  interaction  of  all  other  RAR  ligands.  Arg278  may  also  interact  with 


the  pyridine,  according  to  our  docking  simulation  (Figure  3  C).  The  development  of  molecules 
which  conserve  this  original  pyridine,  but  with  improved  affinity  for  the  receptor  could  represent 
a  new  generation  of  modulators  of  RAR  activity.  The  energy  of  complexation  could  be  improved 
for  instance  by  reducing  the  flexibility  and  increasing  the  hydrophobicity  of  the  linker  region, 
which  bridges  the  pyridine  to  the  penta-methyl  diphenyl  ketone  moiety.  According  to  our  model, 
the  nitrogens  of  the  linker  are  not  making  stabilizing  interactions  with  the  receptor  and  could  be 
replaced  by  carbons,  while  one  of  the  two  carbonyl  oxygens  of  this  domain  could  be  engaged  in 
a  hydrogen  bond  with  Ser234  (however,  this  interaction  is  probably  not  critical,  since  agonist  3 
activates  also  RAR(3  and  RARy,  which  have  an  alanine  at  position  234).  The  structure  of  agonist 
3  and  the  model  of  its  interaction  with  RARa  detailed  here  represent  an  avenue  towards  new 
RAR  agonist  ligands. 


Conclusion 

This  report  details  the  rapid  discovery  of  RAR  agonists  with  novel  structural  features,  thanks  to  a 
powerful  high  throughput  virtual  screening  approach,  and  a  research  strategy  where 
considerations  on  existing  ligands  are  avoided.  One  of  the  molecules  presented  here  constitute  a 
good  framework  for  the  development  of  a  novel  series  of  RAR  ligands,  deprived  of  carboxylate 
and  very  different  from  all  structures  described  so  far.  Such  ligands  could  present  more 
favorable  specificity  and  toxicity  profiles,  and  have  important  applications  in  cancer  therapy. 
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Experimental  Section 


Modeling  of  RARa  ligand  binding  pocket 

The  crystal  structure  of  RARy-LBD  complexed  to  all-trans  RA  was  used  as  a  template  and  the 
three  residues  in  the  vicinity  of  the  ligand  which  are  not  conserved  between  the  two  isoforms 
were  mutated  accordingly:  A234,  M272,  and  A397  were  changed  to  S234, 1272  and  V397 
respectively.  The  rotation  variables  of  the  side  chains  within  3.5  A  of  the  mutated  residues  were 
unfixed  and  the  energy  of  the  system  was  minimized  in  the  internal  coordinate  space,  according 
to  the  ICM  method26. 

Docking  of  AM580  into  RARa 

The  flexible  ligand  was  docked  into  a  combination  of  five  potential  map  representations  of 
RARa  ligand  binding  pocket,  which  account  for  hydrogen  bonding,  hydrophobicity, 
electrostatics  and  entropy  parameters27.  This  rapid  docking  procedure  was  followed  by  a  more 
refined  energy  minimization  of  the  complex,  with  a  full  atom  representation  of  the  receptor,  and 
flexible  receptor  side  chains,  by  an  ICM  stochastic  global  optimization  algorithm  ’  as 
implemented  in  the  Molsoft  ICM2.7  program16. 

Virtual  screening  of  the  compound  structure  database 

The  procedure  followed  was  the  same  as  previously  described8:  each  flexible  ligand  of  the 
Available  Chemicals  Directory  (MDL  Information  Systems,  San  Leandro)  was  docked 
automatically  into  the  combination  of  potential  maps  described  above,  and  assigned  a  score 
according  to  its  fit  with  the  receptor.  The  scoring  function  included  continuum  as  well  as  discreet 
electrostatics,  hydrophobicity  and  entropy  parameters19.  The  screening  of  the  over  150,000 
ligands  database  took  less  than  a  month  on  10  "194  MHZ  IP25"  processors.  The  5364 
compounds  which  scored  better  (i.e.  lower)  than  -32  were  pre-selected  for  a  second  round  of 
selection,  and  were  automatically  docked  into  a  full  atom  representation  of  the  receptor,  with 
flexible  receptor  side  chains,  according  to  a  global  energy  optimization  in  internal 
coordinates27,16.  The  binding  energy  of  the  compounds  was  then  evaluated30  and  the  300 
compounds  showing  the  lowest  binding  energy  were  selected  for  further  examination.  After 


careful  visual  inspection  for  shape  complementarity  and  hydrogen  bonding  network,  30 
molecules  were  selected  and  purchased  to  be  experimentally  tested  in  vitro. 

Biological  activity  of  the  ligand  candidates 

HeLa  cells  were  transfected  by  calcium  phosphate  precipitation  using  1  pg  of  the  Gal4- 
responsive  CAT  reporter  pMCl  10  and  1  pg  of  Gal4-hRARa-LBD  or  1  pg  of  Gal4- 
mRXR(3-LBD.  Studies  were  also  performed  with  the  three  wild  type  hRAR  isoforms  using  a 
AMTV-IR-CAT  reporter  as  previously  described  ’  .  Cell  cultures  were  supplemented  with 
indicated  ligands  immediately  after  addition  of  the  calcium  phosphate/DNA  precipitate.  Media 
and  ligands  were  replaced  after  24  h  and  cells  were  harvested  and  assayed  for  CAT  activity  24  h 
later. 
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Legends  to  Figures: 

Figure  1:  Docking  of  a  known  RARa  specific  agonist.  The  RARa  selective  agonist  Am580  was 
docked  into  the  modeled  ligand  binding  pocket  of  RARa.  A:  The  complexed  ligand  (white 
sticks)  superimposes  with  the  crystal  structure  of  bound  all-trans  RA  (green).  Hydrogens  are  not 
shown  for  clarity.  B:  Am580  (CPK  display)  fits  tightly  into  the  receptor’s  pocket  (yellow  wire), 
but  for  a  ketone  oxygen,  which  shares  an  hydrogen  with  Ser234  of  the  receptor  (displayed  as 
stick).  The  receptor  in  the  vicinity  of  the  ligand  is  shown  as  a  white  ribbon.  Carbons,  hydrogens, 
oxygens  and  nitrogen  are  colored  white,  grey,  red  and  blue  respectively.  (Image  generated  with 


Molsoft  ICM) 


Figure  2:  In  vitro  activity  of  the  novel  RAR  agonists.  HeLa  cells  were  transfected  with  a  Gal4- 
hRARa-LBD  expression  vector  and  a  Gal4-CAT  reporter  gene.  The  CAT  activity  induced  by 
each  ligand  at  0.2, 1  and  10  pM  was  measured.  The  activity  induced  by  the  natural  hormone  all- 
trans  RA  at  100  nM  was  used  as  a  positive  control.  Similar  activity  was  observed  with  the  other 
two  RAR  isoforms. 

Figure  3:  Structure  of  the  novel  RAR  agonists.  Agonists  1,  2  and  3  are  shown  (respectively  A,  B 
and  C).  Left:  chemical  structure  of  the  compounds.  Right:  Representation  of  the  compounds 
docked  into  the  binding  pocket  of  RAR  (important  residues  are  displayed  as  sticks.  A:  R274, 
R278,  S289,  S234.  B:  R274,  R278,  S289.  C:  C237,  R278,  S234),  and  superimposed  with  the 
crystal  structure  of  all-trans  RA  (green).  The  receptor  is  represented  as  a  white  ribbon. 
Hydrogens  are  not  displayed  for  clarity.  Color  coding:  carbons,  oxygens,  nitrogens,  sulfurs, 
fluorides  and  hydrogens  are  colored  white,  red,  blue,  yellow,  magenta  and  grey  respectively. 


